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THE PSYCHOLOGICAL REVIEW 


KNIGHT DUNLAP 
1875-1949 


Somewhere in the writings of Taine 
there is a passage which emphasizes the 
importance of cultivating the “taste and 
habit of work.” This striking phrase 
might be used as a text in describing 
the life of Knight Dunlap. He had the 
taste and habit of work. He had also 
the rich intellectual endowment which 
made his work yield important results 
in the fields of psychological teaching 
and research. 

Dunlap was a native son of Cali- 
fornia, born in Diamond Spring, and it 
was a Satisfaction to him to spend the 
later years of his academic service in 
his native state. During his training in 
the University of California he realized 
that he wanted to become an experi- 
mental psychologist. Psychology re- 
mained his dominant interest, and was 
pursued with a devotion and an in- 
tegrity that have been the inspiration 
of generations of students. As we all 
know, Dunlap was something of an 
iconoclast; great names did not over- 
awe him. But the name of the man 
who revealed to him the meaning and 
the possibilities of experimental psy- 
chology was one which he always men- 
tioned with a respect not short of rever- 
ence. That was the name of George 
Malcolm Stratton. 

After completing his doctorate at 
Harvard, Dunlap returned to Berkeley 
as instructor in psychology. From 
Berkeley he was called to The Johns 
Hopkins University where he served 


from 1906 to 1936, and held the title 
Professor of Experimental Psychology 
during the last 20 years. During 1917- 
1918 he served with the Army in the 
Air Medical Service. In 1936 he joined 
the faculty of the University of Cali- 
fornia at Los Angeles as Professor and 
Chairman of the Department of Psy- 
chology, and remained there until his 
retirement in 1946. 

In addition to his teaching and re- 
search Dunlap influenced the develop- 
ment of psychology by way of his edi- 
torial activities. He founded the Jour- 
nal of Psychobiology in 1917, and when 
that journal merged with the Journal of 
Animal Behavior to form the Journal of 
Comparative Psychology, the editorial 
responsibility was largely his. Other 
journals and monograph series likewise 
benefited by his help. 

Dunlap’s publications were numerous. 
His first article “The effect of imper- 
ceptible shadows upon the judgment of 
distance” appeared in this JouRNAL in 
1900. His first book was A System of 
Psychology, published in 1912, and the 
last, in 1946, was Religion, Its Func- 
tions in Human Life. He was respon- 
sible for many experimental studies, 
several well known books, and the in- 
vention and design of apparatus. 

It is not, however, the quantity of a 
man’s output for which we remember 
him. Dunlap was original and stimu- 
lating to a high degree. Fortunately 
we have his own summary of his main 
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contributions, and those who are stu- 
dents of psychological literature will 
recognize the central importance of the 
problems with which he wrestled. He 
classifies his work as theoretical, ex- 
perimental and instrumental, and under 
the caption of theory we have the fol- 
lowing seven topics. 

1) The attack on introspection. 2) 
The insistence on response or reaction 
as the basis of mental processes, in- 
cluding thought processes. 3) The at- 
tack on images and the insistence on 
objects of perception and of thought, 
in general, as real objects, instead of 
“psychic” objects, “states of conscious- 
ness,” “sensations,” and “images.” 4) 
The emphasis on the periphery, in gen- 
eral, instead of on the brain, as the 
fundamental determiner of psychologi- 
cal qualities. 5) The elimination of 
“instincts” and the emphasis on desires. 
6) The view of consciousness as an ab- 
stract reference to an undeniable fact 
of response. 7) The reformation of 


our views on heredity. 
In elaborating these positions he 
made many forceful and subtle com- 


ments. Whether one agrees or not 
with his theories there is no question 
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but that he stimulated the discussion 
and advanced the understanding of vital 
psychological problems. In later years 
he gave much time to social, religious 
and clinical studies. In discussing child 
care he once said that “keeping the 
baby happily stimulated during its 
waking periods, and preventing crying, 
while not ‘spoiling’ the child, is a diffi- 
cult task, too difficult, perhaps, for the 
intelligence of many parents. Spoiling 
a child, however, is a minor evil; neg- 
lecting it is a major one.” 

Among his many memorable sen- 
tences the following could be taken as 
an important part of his creed. “As a 
psychologist, I have the psychologist’s 
prejudice, that ideals, intellectual analy- 
sis, and education are the fundamental 
forces of progress, and that laws, con- 
ventions, and customs serve to consoli- 
date and make secure the gains achieved 
through these forces.” 

Knight Dunlap was a man most 
generous of his time and thought, keen 
in analysis and criticism, and fertile in 
psychological invention. For the ten 
talents lent to him he made a rich re- 
turn. 

Kate Gordon Moore 


STATISTICAL BEHAVIORISTICS AND SEQUENCES 
OF RESPONSES 


BY GEORGE A. MILLER AND FREDERICK C. FRICK 
Harvard University 


This paper is concerned with pro- 
cedures that can be used to analyze 
serial dependencies in chains of re- 
sponses. In order to introduce these 
procedures we shall begin with some 
general opinions about experimenta- 
tion in modern psychology. These 
opinions comprise a point of view 
which we propose to call statistical 
behavioristics. This point of view 
represents a methodological bias that 
we believe is accepted by many ex- 
perimental psychologists; it has not, 
however, received explicit formulation 
nor has it been fully exploited. 

We have chosen to call this ap- 
proach behavioristics because behav- 
ior seems to be the observable datum 


of psychology. We have modified 
the noun by tne word statistical be- 
cause behavior has that kind of com- 
plexity that seems to preclude, at 
least for the present, a completely 
deterministic account. 


STATISTICAL BEHAVIORISTICS 


Most psychological experiments 
involve the classification of an organ- 
ism’s responses and the tabulation of 
the number of responses in each class. 
There are many difficulties concealed 
in this summary sentence—the de- 
cisions as to which responses to re- 
cord, which environmental changes 
to use in their classification, how large 
a sample is necessary, etc. In what 
follows it will be assumed that the 
psychologist has been able to limit 
his interest to certain portions of the 
behavior stream that he feels are 
peculiarly diagnostic, and that these 
portions can be recognized whenever 


they occur. They have the status of 
“responses” or ‘‘response_ units.” 
The psychologist’s problem is to de- 
scribe the relative frequencies of oc- 
currence of responses in each of the 
classes he has set up. 

A_ psychophysical example. In 
order to make the discussion more con- 
crete, consider an example from psy- 
chophysics. The subject pushes one 
key when a stimulus increases in 
magnitude, another key when the 
stimulus decreases in magnitude. All 
the subject’s responses during the 
session are not recorded, but only the 
subset of responses that involve 
pushing a key. This subset of re- 
corded responses, R, can be classified 
in two ways: (1) into subset A, the 
responses to key a, and subset B, the 
responses to key b, or (2) into subset 
Si, the responses to stimulus change 
51, subset S:, the responses to stimulus 
change s2, and subset $3, the responses 
to stimulus change s3. In order to 
correlate responses to both key and 
stimulus, further subsets are formed 
by the logical products (those ele- 
ments common to the two subsets) : 
AN Si, the set of responses to key a 
when s; was presented, AN S2, the set 
of responses to key a when sz was 
presented, etc. The final step is to 
infer the differential limen from the 
relative frequencies of responses in 
each of the subsets. @ 

The classification is shown graph- 
ically in Fig. 1. Fig. 1A represents 
each response by a dot, and responses 
included in each subset are surrounded 
by the subset contours. Fig. 1B 
gives the classification in the form of 
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[RESPONSES TO KEY b IN 
PRESENCE OF STIMULI Sj] PRESENCE OF STIMULI Sj] 


AnS, AnS; BnS, BnS, BnS; 


(RESPONSES TO (B) fresronses To 


KEY KEY 
[RESPONSES NOT 


(R) (RECORDED RESPONSES] 


[ALL RESPONSES MADE IN 
EXPERIMENTAL SITUATION] 


[RESPONSES TO KEY a IN 


Fic. 1. 


Two graphical representations for the classification of 


responses in a hypothetical psychophysical experiment. 


a tree, where the classes become more 
specific as we ascend the tree. The 
classification can also be given by the 
headings on a statistical table that 
summarizes the data (cf. Table I). 
The classification is fundamental 
because it must contain, along with 
the definitions of, the classes, all the 
information that is available for the 
analysis of the experiment. If the 
classification is complete, there is no 
further information about any ele- 
ment of S. The classification says 
that any element of S possesses cer- 
tain specified properties, and to that 


extent the information narrows the 
search for the particular element to 
the set of all those elements of S that 
possess these properties. If the re- 
sults of an experiment can be pre- 
sented in a table they can be described 
in these general terms. The basic 
observations are concerned with the 
relative frequencies of responses in 
different classes. 

It may be convenient to summarize 
the distribution of responses in the 
different classes by a single statistic 
and to describe how this statistic 
varies with other dimensions of the 
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situation. This: extension compli- 
cates the classification rather than 
changing its fundamental nature, and 
usually neglects some of the informa- 
tion. 

The course of action. It is impor- 
tant to notice that the psychophysical 
classification is umordered. Re- 
sponses were classified first on the 
basis of the key depressed, then on the 
basis of the stimulus presented. 
Classification in the reverse order 
would have served the purpose equally 
well. Unordered classifications have 
been the rule in psychological re- 
search, and most of the familiar 
statistical procedures are designed to 
handle such classifications. 

There are situations, however,where 
the order of classification is definitely 
specified. For example, geneticists 
build classifications that include the 
relation ‘“‘parent and successive 
classes cannot be interchanged. 

An ordering relation that is of 


particular interest to the psychologist 


is ‘precedes in time.’ This relation 
gives sequential information that has 
been largely neglected in the classifi- 
cations of responses. In the psycho- 
physical experiment just described, 
for example, the successive judgments 
were embedded in the time contin- 
uum, but no record of the order was 
kept and the serial information is not 
available in the classification shown 
in Fig. 1. 

In order to include the sequential 
information, the first step is to decide 
what responses to record. In Fig. 1 
the recorded responses are defined 
according to key and stimulus inten- 
sity into six subsets—refer to these 
now as responses A, B, C, D, E, and 
F. The next step is to distinguish 
the set of responses of set A that were 
immediately followed by responses of 
set A; the set of responses of set A 
that were followed by responses of set 
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B; the set of responses of set A that 
were followed by responses of set C; 
and so on until all possible digrams 
are listed. In this example there will 
be 36 digrams. 

By distinguishing the various possi- 
ble pairs of responses another step is 
added to the classification of Fig. 1. 
As many further steps can be added 
as necessary—just how far we need to 
proceed will be discussed later. The 
set of recorded responses is partitioned 
into A, B, C.... Each of these classes 
is partitioned into AA, AB, AC.... 
BA, BB, BC,...CA, CB, CC, ete. 
Then each of these is further parti- 
tioned into AAA, AAB, AAC.,... ABA, 
ABB, ABC,... ACA, ACB, ACC, etc. 
Sequences of four, five or m responses 
in a row can be tabulated to preserve 
the sequential information. 

Ignoring the sequential information 
is equivalent to assuming that suc- 
cessive responses are independent. 
If this assumption is correct there is 
little information to be gained from a 
sequential tabulation. According to 
the assumption of independence, if 
the relative frequency of response A 
is p(A) = 0.5, and of response B is 
p(B) = 0.5, we could assume that the 
relative frequencies of AA, of AB, of 
BA, of BB are all 0.25, and of triads 
are 0.125, etc. If an actual tabula- 
tion of sequences shows significant 
departures from these values, then the 
assumption of independence is not 
justified, and the frequency distribu- 
tion for sequences of m responses can- 
not be derived from the distribution 
for individual responses. 

It is easy to see that a sequential 
classification can be exhaustive. To 
take a specific example, suppose the 
experiment defines two _ responses, 
0 and 1, and that the sample consists of 
sequences of ten such responses from 
each subject. The possible number 
of different sequences of ten responses 
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with two alternatives is 2'°, or 1024. 
If the sequential classification is con- 
tinued out to sequences of ten re- 
sponses, the classification will include 
all the 1024 courses of action that are 
possible in the experimental situation.! 

In summary: Psychologists define 
classes of responses, determine their 
relative frequencies of occurrence and 
attempt to discover how the frequency 
distributions change as the experi- 
mental variables are manipulated. 
Within this broad area of statistical 
behavioristics lie problems that in- 
volve sequences of responses, or 
courses of action. The problem is to 
predict the frequencies of these courses 
of action, just as individual response 
frequencies are predicted in the non- 
sequential case. 


APPROXIMATE REPRESENTATION OF 
THE COURSE OF ACTION 


The psychologist’s experiments usu- 


ally generate a sequence of symbols: 
right and wrong, conditioned and un- 
conditioned, left and right, slow and 


fast, adient and abient, etc. The 
analysis must begin with this system 
of symbols that the experimenter 
writes in his records. The record 
might consist of R’s for correct re- 
sponses in a learning experiment and 
W’s for errors, and the analyzer would 
be faced with something like this: 


RWWWWRWRWR- 
WWWRWRWWIHWR, etc. 


In the first twenty responses there are 
0.35 correct responses and 0.65 errors. 

Now consider the mathematical 
models that might be taken from the 


1A sequence given in Os and 1s can be 
treated as a number in the binary system and 
translated to the corresponding number in the 
more familiar decimal system. For example, 
0001011101 is the equivalent of 93. Thus the 
decimal numbers from 0 to 1023, when jotted 
on the record, preserve the complete sequential 
information in the example discussed. 
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theory of probability and used to 
summarize these observed frequencies. 
In the simplest model each symbol in 
the sequence leaves the probabilities 
unaltered. The occurrence of the 
symbol returns the system to the 
same state, and the distribution of 
probabilities is the same after: the 
symbol occurs as it was before. This 
formal model is presented graphically 
in Fig. 2. Here the arrows represent 
the occurrence of a symbol, the dot 
represents the single state of the 
system, and both arrows return the 
system to the original state. 


Fic. 2. Diagrammatic representation of a 
sequence of independent events. An arrow 
is traversed when the event occurs, and both 
events return the process to the same original 
state. 


Dependent probabilities. The for- 
mal model depicted in Fig. 2 con- 
siders successive symbols as independ- 
ent events. This model would de- 
scribe the results obtained by drawing 
marbles (and replacing them) from an 
urn that contained the appropriate 
proportions of R and W marbles. 

So simple a model may not be 
sufficient. It may be necessary to 
consider sequential dependencies, 
which is done by counting the occur- 
rences of pairs of items. For the 
record given above such a count shows 
seven occurrences of WW, six occur- 
rences of RW, six occurrences of WR, 
and no occurrences of RR. The re- 
sults can be presented in the form 
shown in Table I. 

When confronted with four-fold 
tables of this sort, psychologists 
naturally turn to a contingency meas- 
ure of the degree of association be- 
tween the arguments. Since these 
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TABLE I 
SHOWING THE DISTRIBUTION OF PaIRS OF RESPONSES 


First response 
Total 


Right 


Wrong 


Second p(RR)=0 


response 


Total 


Right 
Wrong 


=6/19 


p(RW) =6/19 


=6/19 
=13/19 


1.0 


p(WR) =6/19 
p(WW) =7/19 


p(W) =13/19 


techniques are familiar, the following 
discussion will be restricted to the 
novel, but related, methods of analy- 
sis that have been developed as a part 
of the mathematical theory of com- 
munication (4). 

Note that the relative frequencies 
of R and W are not the same when R 
has just occurred as they are when W 
has just occurred. If this difference 
is to be reflected in the mathematical 
model, the model must have two dis- 
tributions of probabilities—one dis- 
tribution that applies following the 
symbol R, and another distribution 
that applies following the symbol W. 


(W) 


(R) 


Fic. 3. Diagrammatic representation of 
the second-order approximation of a sequence 
of dependent events. The event R always 
leaves the process in one state, while the event 
W leaves it in the other. 


Or, phrased differently, the system 
must have two states, one state that 
follows R and another state that fol- 
lows W. The occurrence of a symbol 
does not always return the system to 
the same state, and thus the math- 


ematical model admits dependent 
probabilities. This formulation is 
shown graphically in Fig. 3. 


As in Fig. 2 the dots in Fig. 3 rep- 
resent the states of the system, and a 
symbol is generated whenever an 
arrow is traversed. There are several 
probabilities to be stated for the 
model in Fig. 3. There are the prob- 
abilities that the organism will get 
into one state or the other, p(W) and 
p(R). There are the probabilities of 
the several pairs of responses, p(WW), 
P(WR), p(RW) and (RR). And 
there are the transitional probabili- 
ties, pw(W), Pe(W) and pr(R), 
that when the organism is in a given 
state (indicated by the subscript) the 
next response will leave the organism 
in the same state or change it toa new 
state (indicated by the argument). 
All these probabilities are related in a 
straightforward way : 


=p(WW) +p(WR) 
+p(RW)+ =1, 
pbw(W)+hw(R) = =1, 
p(WW) = p(W) -pw(W). 


The mathematical systems repre- 
sented in these diagrams are called 
finite, discrete Markoff processes, a 
special class of stochastic processes. 
A finite number of possible states of 
the process exist, Q:, Qe, Q3,...02 A 
different distribution of probabilities 
holds for each of the several states, 
and which distribution applies de- 
pends upon the preceding events that 
got the system into its present state. 
Thus the states represent the residual 
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influence of the preceding events. A 
sequence of infinite length can be 
generated by a process with a finite 
number of states, for the system can 
return to any state over and over 
again. 

A sequence of events can have inter- 
dependencies extending over several 
successive events. If such a series is 
represented by a Markoff process of 
the type shown in Fig. 2, it is a first- 
order approximation. If the prob- 
abilities of pairs of events are con- 
sidered, as in Fig. 3, it is a second- 
order approximation. When the 
analysis is extended to sequences of s 
events, it is the s-th-order approxima- 
tion. Most psychological experi- 
ments have not been analyzed beyond 
a first-order approximation, where 
successive responses are assumed to be 
independent. 

In general, if the classification dis- 
tinguishes a mutually exclusive events, 
the s-th-order approximation is made 
up of a* transitional probabilities 
Pi;...(k) (the probability that, given 
the sequence ij..., the event & will 
follow), and a*~ states corresponding 
to the various sequences of s-1 events 
that might precede the s-th event. 

To summarize: 


a’ The number of classes nec- 
essary for a complete tab- 
ulation of a alternatives 
considered in sequences of 
S. 

The number of states of the 

Markoff process, the num- 

ber of tabulated antece- 

dents of the s-th event in 

the sequence of s. 

The number of degrees of 

freedom in the tabulation. 

There are different 


a*~'(a-1) 


states, and each state has 
associated with it a-1 de- 
grees of freedom. 
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These mathematical models gen- 
erate infinite sequences of events. 
This would seem to rule out the Mark- 
off process as a model for the study of 
learning, for during the course of 
learning the relative frequencies 
change progessively.?, This is not a 
new problem to the psychologist, for 
whenever statistical models based 
upon hypothetically infinite popula- 
tions are applied to behavior he is 
faced with the sampling problem. 
The samples must be large enough to 
allow the relative frequencies to ap- 
proach their limits, but short enough 
to insure that the process has not 
changed significantly from the begin- 
ning to the end. It should be noted, 
however, that the sampling difficulty 
is acute with the type of statistical 
model discussed here. For example, 
in a third-order approximation with 
three alternative responses the ‘‘be- 
havior space”’ is granulated into 27 
classes. The proportion of responses 
in each class must have time to ap- 
proach the parametric value. If the 
sample consists of 20 responses (18 
sequences of three responses), it is 
clearly impossible to get an adequate 
sample for each of 27 classes. 

If the behavior system changes ap- 
preciably on each successive response, 
elaborate statistical description is 
limited. When the behavior has 
settled into a persistent pattern, large 
samples can be obtained before the 
system changes, and extensive analy- 
sis is possible. 

From the consideration of depend- 
ent probabilities it is clear that the 
extensive form of classification, dis- 
cussed in the first section of this paper, 


2 Dr. Frederick Mosteller has suggested in- 
formally that a broader interpretation is possi- 
ble. Learning might be regarded profitably 
in terms of a higher-order process of the 
Markoff type that has Markoff processes as 
its component states. 
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can be considerably abbreviated and 
still be descriptively adequate. To 
review briefly: all the possible courses 
of action in a given experimental situ- 
ation can be represented by an ex- 
tensive classification based upon se- 
quential information. The problem 


is to know which courses of action . 


are most frequent under a given set of 
circumstances. If the successive re- 
sponses can be regarded as generated 
by a Markoff process, the probability 
for any complete course of action can 
be summarized in a relatively small 
number of transitional probabilities. 


AN INDEX OF BEHAVIORAL 
STEREOTYPY 


The next question concerns the ex- 
perimenter’s accuracy in predicting 
the successive responses of his subject. 
Predictability depends upon (1) the 
range of alternative responses the sub- 
ject can make and (2) the relative 


frequencies with which these alterna- 
tives occur. A quantitative index of 
predictability, or, as we shall call it 
here, an index of behavioral stereotypy,® 
must be based upon these parameters. 
Such a statistic can be derived for the 


mathematical models discussed above . 


(4). 

The number of alternatives. Con- 
sider first the case where all classes of 
events are equally probable. If there 
are only a few alternative classes a 
prediction is more certain than if 
there are many alternatives. ‘“Any- 
thing can happen”’ is a common ex- 
pression of uncertainty. In other 
words, the uncertainty U is a mono- 
tonic increasing function of the num- 


3 Not the best possible term, perhaps. We 
use it to refer to the “orderliness” of behav- 
ioral sequences, without specifying whether 
the orderliness is due to learning, fixation or 
lack of understanding. Just how this usage 
is related to that current among social 
psychologists is a matter for some debate. 
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ber of alternatives. Some freedom is 
possible in the choice of this function, 
but a logarithmic function is the most 
convenient. 

This convenience can be demon- 
strated with a simple numerical ex- 
ample. Suppose a choice is made 
from among 10 equally probable 
alternatives, and that, having made 
this first choice, a second choice must 
also be made from among 10 equally 
probable alternatives. Two succes- 
sive choices from 10 alternatives make 


‘possible 100 alternative pairs of 


choices—when the alternatives are 
equally probable and successive 
choices are independent, two choices 
from 10 alternatives are equivalent 
to one choice from 100 alternatives. 
If the uncertainty in these two situ- 
ations is represented by a linear func- 
tion of the number of alternatives, the 
first choice from 10 alternatives rep- 
resents 10 units of uncertainty, and 
the second choice adds 90 more. It 
is not obvious why the second event 
should be assigned nine times the un- 
certainty of the first, and it would be 
more satisfactory if a choice from 10 
alternatives always represented the 
same amount of uncertainty. In 
other words, we would !ike to have an 
additive measure of uncertainty. 
This is realized if uncertainty is a 
logarithmic function of the number of 
alternatives. Then the uncertainty 
of the first event is log 10 and the un- 
certainty of the second event is log 
10; the total uncertainty for two suc- 
cessive events is 2 log 10, which is 
equal to log 100, the uncertainty of 
one choice from 100 alternatives. 

In general, if there are a equally 
probable ways an event can happen, 
and if it occurs s successive times, U is 
constant so long as a* is constant. 
It is desirable to have the uncertainty 
of a* possibilities, U(a*), equal to s 
times the uncertainty of a alternatives, 
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That is to say, it is convenient if 
U(a") = s- U(a). 


Consider two equally uncertain situ- 
ations, where a*=0b'. Take the 
logarithm of both sides of this equa- 
tion, 

s log a = t log b. 


For the two equally uncertain situ- 
ations, 
s-U(a) = t-U(6). 


Substituting for ¢/s and solving for 
U(a): 
U(b) 


log a 


=K log a= —K log (1/a). 


The uncertainty involved in a choice 
among a equally probable alternatives 
is proportional to the logarithm of a. 

Disproportionate probabilities. The 
preceding discussion is restricted to 
alternatives of equal probability. 
This formulation can be extended to 
include alternatives of different prob- 
abilities. A numerical example will 
illustrate the extension. Suppose the 
choice is from three alternatives that 
have probabilities of 0.2, 0.3 and 0.5. 
This can be transformed into an 
equivalent case with equal probabili- 
ties, for it is possible to imagine that 
this is the first of two successive 


10 


-Odl- 
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choices, and since the imaginary sec- 
ond choice is at our disposal it can be 
used to make all the pairs of choices 
equally probable. After choosing ~; 
= 0.2 there would then be a choice 
between two equally likely alterna- 
tives; after p2 = 0.3 a choice among 
three alternatives, and after p; = 
0.5 a choice among five alternatives. 
The compound probabilities for the 
pairs of choices are then 0.1, and the 
uncertainty of the double choices is 
equivalent to the uncertainty of a 
single choice among 10 equally prob- 
able aternatives. This situation is 
illustrated in Fig. 4. 

Now calculate the uncertainties in- 
volved in Fig. 4. In addition to the 
uncertainty of the first choice, U(#,, 
¥5:1°5), 0.2 of the time there is a second 
choice of uncertainty U(},4), 0.3 of 
the time a second choice of U(4,4,4), 
and half the time a second choice of 
U(3,3,3,3,4). Since we want an ad- 
ditive measure of uncertainty, we are 
interested in the sum of all these 
terms. The sum is equal to the un- 
certainty of a single choice among 10 
equally probable alternatives. This 
can be written as 


+0.2 U(3,4) 
+0.3U($,3,4) +0.5 
= 


Fic. 4. Showing how a choice among unequally probable events is transformed 
by a second choice into a choice among equally probable pairs of events. 


| | 
10 10. 10 | 10 10 10 10 10 
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Four of the five terms in this equation 
involve equally probable alternatives 
with determinable uncertainty. So 


= K log 10—0.2K log 2 
~0.3K log 3—0.5K log 5. 


With logarithms to the base 10 the 
answer is 


U=K(1—0.06—0.14—0.35) =0.45K 


Note that if the three alternatives 
had been equally likely, U would have 
been K log 3, or 0.48K. Any de- 
parture from equal probabilities re- 
duces the uncertainty. 

The procedure used in the example 
can be generalized to give a solution 
for any set of disproportionate alter- 
natives. In a general notation p; = 


n,/=m,, where p; is the probability of 
an event occurring in manner 7, and 
nm; is the number of occurrences of 
members of class 7. 


Na 


Then 


ni 
log n;=K log 
where the second term on the left and 
the term on the right side of the equa- 
tion represent the uncertainties for 
the equally probable alternatives that 
are used to obtain the solution. Now 
solve the equation for U: 


U=K(log log 
Multiply the constant log =m, by 
=p; = 1, to obtain 

U=K(2p,; log =p; log nj) 
rn; 

= — log p; 

This derivation is given for the field 
of rational numbers, but if the p; are 


in the field of irrationals, they may be 
approximated by rationals. This ap- 
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proximation requires the assumption 
that probability is a continuous func- 
tion. 

An examination of the formula for 
U shows that U is the mean value, or 
mathematical expectation, of the func- 
tion log p;. In other words, when the 
probabilities of the separate alterna- 
tives are not equal, we use the ex- 
pected value of log p;. 

The derivation gives a general ex- 
pression for uncertainty which is 
equivalent to the physicist’s equation 
for entropy. When the probabilities 
for the various alternatives are known, 
this formula can be used to compute 
the uncertainty with which any 
particular, individual response can be 
predicted in the absence of any knowl- 
edge about the preceding or following 
responses. 

The value of K in this equation 
amounts to a choice of a unit for the 
measure of uncertainty. It is con- 
venient to choose K in such a way 
that it equals 1 when the base of the 
logarithm is taken as 2. If the base 
2 is used the resulting units are binary 
digits, or more briefly bits. The 
maximum uncertainty in predicting 
one of two alternatives is one bit of 
uncertainty. Making s such decisions 
in a row gives s bits of uncertainty, 
since the total number of possible 
alternatives is 2*, and log, 2" = s. Of 
course logarithms to the base 10 can 
be used, and the resulting units are 
decimal digits. A decimal digit is 
3.32 bits. To express the equation in 
bits when logarithms to the base 10 
are used, K must have the value 3.32. 
In the following discussion we will use 
the bit as the unit of uncertainty. 

Dependent probabilities. It has 
been assumed that a behavior se- 
quence can often be represented by a 
Markoff process with a definite num- 
ber of states. If the process is in any 
given state there are several possible 
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events that~ can follow. Conse- 
quently, there is a degree of uncer- 
tainty associated with each state. In 
one state the next event may be very 
certain, while in another state the next 
event may be relatively uncertain. 
To represent the uncertainty for any 
particular state, write —2,:(j) logs 
pi(j), where p:(j) is the probability 
that state 1 will be followed by event 
j, and the summation is for all the 
possible j that could occur. The un- 
certainty for the whole process must 
be the weighted average of the un- 
certainties for each of the separate 
states. The contribution of each 
state to the average is weighted ac- 
cording to the probability of occur- 
rence of the state, P;, and so repre- 
sents the mean value or mathematical 
expectation of U for the several states. 
Thus the average uncertainty per 
response, when the s-th approximation 
is used, is 


U.= P; pi(j) loge 


This equation represents the average 
uncertainty in the prediction of the 
next event when the preceding s-1 
events are known. 

This equation is simplified by tak- 
ing advantage of the fact that P:p,(7) 
represents the probability that the se- 
quence leading to state i and also 
event j will occur together. This 
probability is symbolized by p(Q,,R;), 
the probability that state (sequence) 
Q; followed by event R; will occur. 
Similarly, the probability that, given 
sequence Q,,R; will follow is indicated 
by pe.(R;). The equation is written 
as follows: 


U.= —=p(Qi,R;j) logs pe:(R;). 


A series of approximations to U is 
obtained by considering the statistical 
structure of sequences of 1, 2, 3...s 
events. Shannon (4) has shown that 
U, is the uncertainty of the sth order 
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approximation, and as s increases 
without limit, U, approaches U. 
Further, U, is a monotonic decreasing 
function of s, and the uncertainty of 
the next event when the preceding 
s-1 are known is never increased by a 
knowledge of the statistical structure 
of events preceding that. Analysis 
of sequences of events longer than s 
cannot make the prediction less cer- 
tain, but may make it more certain. 

When the mathematical model is 
used to summarize behavioral data, 
U, represents the uncertainty with 
which a response can be predicted 
when only the preceding s-1 responses 
are known. It will be called the 
uncertainty per response. 

To compare results from different 
experiments it is often convenient to 
use the relative uncertainty per re- 
sponse. For any situation the maxi- 
mum value of U is computed by as- 
suming all responses to be independ- 
ent and equally frequent. If there 
are two alternative responses, the 
maximum uncertainty will be —2(0.5 
loge 0.5), or 1.0. With four possible 
responses the maximum uncertainty 
is —4(0.25 logs 0.25), or 2.0. In 
general, Umax = — log2(1/a) = logea, 
where a is the number of possible 
alternatives. The relative uncer- 
tainty, therefore, is given by the ratio 
of U, to Umax, and this ratio may vary 
between 0 and 1. 

We are now in a position to define 
an index of behavioral stereotypy: 


C, = 1 — (relative U,) 


A value of C, of 0.50, for example, is 
equivalent to the statement that half 
of the responses are determined and 
that half are maximally uncertain. 


An EXPERIMENTAL ILLUSTRATION: 
MULTIPLE-CHOICE BEHAVIOR 


The discussion in the preceding sec- 
tions can be crystallized in an experi- 
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mental example. In 1916 G. V. 
Hamilton (1) reported a study of 
trial-and-error behavior in a multiple- 
choice situation and, fortunately, pub- 
lished complete data for many of his 
subjects. Hamilton used an appar- 
ently unsolvable multiple-choice prob- 
lem that required the subjects to find 
their way out of an enclosure from 
which there were four possible exits. 
The correct exit varied from trial to 
trial, so it was useless to try again the 
exit that had proved successful on the 
preceding trial.. No other clue was 
available. This apparatus was sup- 
posed to prevent the formation of a 
specifically adaptive habit and to 
provide an extensive sample of trial- 
and-error behavior. 

The question to ask of these data is 
how unpredictable or how stereotyped 
is trial-and-error behavior in this 
situation? Does a subject system- 
atically vary his responses to produce 
a random sequence of exit attempts, 
or does he settle down to a relatively 
routine and stereotyped pattern that 
is adequate though not always re- 
warded? 

When the data from all of Hamil- 
ton’s 7-year-old girls are taken to- 
gether the sample is large enough to 
permit reasonable analysis. As a 
sample of the records, successive exit 
attempts might run as follows, where 
the numbers designate the four possi- 
ble exits: 


1, 2, 3, 4, 3, 2, 1, 4, 3, 2, 3, 4, 2, etc. 


The record was tabulated for se- 
quences of three successive responses. 
The first three responses constituted 
an entry under the (1, 2, 3) category, 
responses two through four made an 


‘ It surprised us to discover how few psycho- 
logical studies have reported the actual se- 
quential information necessary for this analy- 
sis. It seems to be the first information that 
is thrown away when the data are summarized 
for publication. 
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entry in the (2,3,4) category, re- 
sponses three through five made an 
entry in the (3,4, 3) category, etc., 
until the day’s trial was completed, 
t.e., until the exit had been found ten 
times. When four alternatives are 
tabulated in sequences of three re- 
sponses, there are 4° = 64 different 
categories. The data for the five 
girls consisted of slightly over 1000 re- 
sponses and so permitted a fair sample 
for all 64 possible sequences. A 
tabulation of sequences of four, how- 
ever, requires 256 possible classes, and 
Hamilton’s sample is too small for 
reasonable reliability. 

From the tabulation of sequences of 
three responses we can, of course, 
determine the tabulation for sequences 
of two or one, since p(1, 2) = p(1, 2, 
1) + p(1, 2, 2) + p(1, 2, 3) + p(l, 2, 
4),etc. The tabulation for the longer 
sequence contains all the information 
given by tabulations of shorter se- 
quences. 

The first approximation for the be- 
havior of the five girls shows the 
proportion of responses to door 1, 
p(1), to be 0.21, p(2) to be 0.31, (3) 
to be 0.30, and p(4) to be 0.18. For 
individual responses the distribution 
is almost uniform, with only a slight 
tendency for the girls to favor the 
middle two doors. When there are 
four alternatives, the maximum value 
of U is loge4, or 2 bits. The calculated 
value of U; for these data is 1.96. 
The relative uncertainty is, therefore, 
1.96/2, or 0.98. The index of stere- 
otypy for the first approximation is 
C, = 0.02. With no knowledge of 
preceding responses any prediction of 
the next exit to be tried is very un- 
certain. 

When sequences of two responses 
are considered, however, the picture 
is quite different. Of the 16 possible 
ordered pairs of responses, six pre- 
dominate: (1, 2), (2, 3), (3, 4), (4, 3), 
(3,2) and (2,1). These six response 
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pairs together comprise 80 per cent 
of the tabulated pairs. They are 
made up of responses to adjacent 
exits—apparently it is unusual for a 
subject to walk past a door without 
stopping to try it, or to try the same 
door twice in succession. These tend- 
encies introduce a certain degree of 
orderliness into the behavior, and C, 
is 0.27. With knowledge of only one 
preceding response it is evident that 
trial-and-error behavior in this situ- 
ation is not unpredictable, but is 
stereotyped and relatively orderly. 
It is interesting to compare the 
seven-year-old girls with the single 
adult rat for which Hamilton pub- 
lished complete data. Inspection of 
the tabulations of pairs of responses 
shows a striking similarity between 
girls and rat. The same six pairs of 


responses that the girls favored were 
favored by the rat, and comprised 
78 per cent of its responses. For the 
rat C, was 0.06, and C, was 0.26. On 


the basis of a second-order approxi- 
mation it is difficult to discover any 
difference between the animal's and 
the girls’ adjustment to the situation. 
This in spite of the fact that it took 
the rat approximately twice as many 
responses to complete 100 successful 
exits as it took the average girl. The 
better adjustment of the girls is not 
revealed in the preceding analysis, 
and it is necessary to consider the 
third-order approximation before the 
difference becomes clear. 

The sequences of three responses 
show that the girls tended to make a 
third response different from either of 
the two preceding responses. There 
was no need, they discovered, to try 
again a response that was unsuccess- 
ful the time before last. This tend- 
ency appears in the form of greater 
stereotypy in their behavior, and C; 
is 0.42. The rat, on the other hand, 
was not as proficient in remembering 
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what he had done on the trial before 
the last, and often fell into the unpro- 
ductive pattern of which 3, 2, 3, 2, 3, . 
2, etc., is an example. If a girl made 
the responses 1, 2 it was almost certain 
that her next response would be 3, but 
the rat was no more apt to go on to 3 
than he was to return to 1. C; for 
the rat, therefore, was only 0.31. It 
is unfortunate that too few data are 
available for a fourth-order approxi- 
mation, for the indications are that 
the difference between girls and rat 
would be even more apparent. 

In Fig. 5 C, is plotted as a function 
of s. It is clear that the girls respond 
to remotely preceding responses to a 
greater extent than does the rat. 
Thus the index of stereotypy gives a 
second important fact about the be- 
havior. Not only can we say how 
stereotyped the response sequence is, 
but we can estimate the length of the 
chain over which preceding responses 
have an effect. If one agrees with 
Hunter (2) that a response influenced 
by the penultimate of the preceding 
responses indicates a symbolic proc- 
ess, then Hamilton’s experiment shows 
that girls have a greater capacity for 
symbolic processes than do rats. 
This is not a startling conclusion, but 
it is comforting to find that the com- 
putations do not contradict the obvi- 
ous. 


THE STATISTICAL STRUCTURE 
OF BEHAVIOR 


The central idea in the preceding 
discussion is the course of action. If 
interest is confined to individual re- 
sponses and no attempt is made to 
describe and discuss sequences of re- 
sponses, experimental psychology may 
have difficulty in meeting many prob- 
lems posed in the clinical and social 
areas. Some such concept as the 
course of action seems inevitable. 
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INDEX OF STEREOTYPY 
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ORDER OF APPROXIMATION 


Fic. 5. The stereotypy of trial-and-error behavior. 


To define a course of action it is nec- 
cessary to consider all the alternative 
courses of action that might occur. 
gt is with this set of alternatives—this 
classification—that the behavior ana- 
lyst must work. He is faced, not 
with individual responses or even with 
individual sequences of responses, but 
rather with a matrix of alternatives. 
His job is to decide which ones of this 
large set of alternatives are most fre- 
quent, then to discover the rules for 
manipulating these frequencies. The 
problem is, therefore, to analyze the 
statistical structure of the set of all 
possible courses of action. Or, more 
generally, to analyze the statistical 
structure of behavior. 

This line of thinking makes it obvi- 
ous that it is not necessary for a sci- 
entific psychology that one course of 
action be very frequent and all others 
very infrequent. Such stereotyped 


Data from Hamilton (1). 


behavior seems unadaptive in most 
situations. The job is to describe 
precisely the distribution of frequencies 
among the various alternatives. C,, 
the index of stereotypy, may be zero, 
but if it is determined carefully enough 
and the conditions that make it zero 
are thoroughly understood, we have 
the kind of predictability necessary 
for a science of behavior. 

In different studies the interest may 
be in particular aspects of the stat- 
istical structure, or in changes in 
particular aspects. In learning stud- 
ies, for instance, the criterion may 
often be set in terms of the length of 
the sequence, s, that can become in- 
terdependent. Learning to talk is 
perhaps the best example of such a 
behavioral change. The young child 
is capable of producing all the sounds 
involved in the language, but his job 
is to learn the useful sequences. The 
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predict: dility of any given sound may 
not change appreciably, but the length 
of the response sequence that must be 
analyzed in order to give a close ap- 
proximation to the behavior is greatly 
extended. In other cases, learning 
may be directed toward greater pre- 
dictability—a change in C—without 
any significant change in the length of 
the sequence that must be analyzed. 
Learning to hit a target, for example, 
is a matter of increasing the stereotypy 
of behavior without changing mark- 
edly the response sequence that must 
be analyzed. In still other situations 
the goal is to increase the rate of 
production without modifying either 
C or s. Typewriting provides an 
example. Any literate person can 
generate the printed symbols of lan- 
guage if given sufficient time, but the 
expert typist accomplishes the same 
result in a much shorter time. The 
final products are nearly the same, 
and the measure of learning is the rate 
of production of behavior having a 
specified structure. Fourthly, there 
is the adjustment to an altered en- 
vironment. Such adjustment often 
involves the breaking of established 
patterns of behavior and the adoption 
of new patterns that more nearly 
maximize the gratification of needs. 
Such learning needs to be described in 
terms of the specific shifts in the fre- 
quencies of the several courses of 
action. The values of C, s and the 
rate of responding may remain con- 
stant although the particular de- 
pendencies are altered. 

We have too long classified types of 
learning according to the particular 
experimental procedure involved in 
measuring the learning. Whether or 
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not all learning is reducible to a single 
basic process, it seems possible that 
the results of all learning experiments 
can be described in a uniform manner 
and the results of one experiment 
compared with the results of others. 


The ideas discussed in this paper do 
not originate with the authors. Their 
origins can be traced to the literature 
of communication engineering and the 
theory of games. Most important in 
the realm of communication engineer- 
ing has been the influence of Norbert 
Wiener (5), although we have drawn 
more heavily upon the formulation 
given by C. E. Shannon (4). Similar 
ideas have also been used in von Neu- 
mann and Morgenstern’s analysis of 
games (3), and their concept of a 
strategy is directly comparable to the 
present notion of a course of action. 
Our attempt has been to formulate 
certain psychological problems in such 
a way as to take advantage of the 
techniques developed by these mathe- 
maticians. 
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THE PROBLEM OF QUANTIFICATION IN 
PSYCHOLOGICAL SCIENCE * 


BY DANIEL BROWER 


New York University 


Under the influence of the behavior- 
istic movement in American psychology, 
the quantitative approach has become a 
sine qua non for psychological theory, 
research, and practice. If we assume 
that the task of science is to establish 
hypotheses which enable us to describe, 
explain, predict, and control certain 
classes of phenomena, then the question 
before us is: To what extent does quan- 
tification facilitate these aims of psy- 
chological science and to what extent 
does quantification impede the attain- 
ment of these goals? 

Quantification refers to the process 
of expressing our observations of phe- 
nomena in terms of numbers or other 


discrete figures. It would seem helpful 
to clarify at the outset what we mean 
by quantification in this sense and in 
the broader manner of expressing rela- 
tionships which shall be referred to as 


“continuous dispersion.” By continu- 
ous dispersion, we refer to an unbroken 
chain of amounts. For example, the 
traditional mode of expressing intelli- 
gence quotients has implied a difference 
between an I.Q. of 120 and an LQ. of 
121 or between an I.Q. of 69 and an 
1.Q. of 71. Gradually, however, clini- 
cians have altered, by empirical neces- 
sity, their mental set toward the I.Q. 
and now envisage intellectual potential 
as continuously dispersed from “too lit- 
tle” to “too much” to be registered in 
the measuring instrument. Thus, the 
basic premise of this paper is that psy- 
chological phenomena are distributed in 
unbroken chains of altering amounts; 


1 Read before a seminar at The New York 
Psychoanalytic Institute, March 7, 1947. 


and that our quantitative methods in 
psychology succeed in identifying only 
some of the relationships of amount 
within a given continuous series. A 
suggested inference that may follow is: 
Quantitative methods suffer from an 
apparently unavoidable fallacy in so 
far as the measures or scores employed 
represent select samples of the continu- 
ous series of the phenomenon in ques- 
tion. 

Quantitative methods have found an 
extraordinary degree of application in 
psychology and have been insistently 
demanded on the American scene for a 
number of reasons. First of all, they 
represent a natural accompaniment of 
our mechanical age and the emphasis 
on engineering and physical science. 
Secondly, we have unwittingly at- 
tempted to emulate physics as_ the 
prototype of science without elaborat- 
ing the intrinsic differences between 
psychology and physics. The meth- 
odology of physics makes possible a 
degree of detachment of subject-matter 
from observer which can, thus far, be 
attained in psychology only by doing 
damage to the phenomenon through 
artificialization. In the history of mod- 
ern physics, astronomy, chemistry, etc., 
the recognition of the “personal equa- 
tion” certainly was a boon to the de- 
velopment of those fields. While the 
facts of individual differences in per- 
ception were derived from psychology, 
physical scientists did not find it neces- 
sary to incorporate psychological meth- 
ods, e.g., introspection, along with their 
factual data. As psychology grew on 
the substrate of natural science, how- 
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ever, not only were the facts of physics 
incorporated into psychology but the 
principal method as well: quantifica- 
tion. 

There are other reasons to account 
for the quantitative emphasis in Ameri- 
can psychology. The widely popular 
and growing use of mental tests with 
the accompanying theory of psycho- 
metrics found haven in the realm of 
numbers. Correlational analysis, va- 
lidity and reliability determinations, 
etc., are cases in point. One of the 
most immovable stones in the whole 
psychometric field is the attempt to 
formulate criterion data in statistical 
terms. The feeling has also developed 
that psychological writings which are 
devoid of correlation tables are synony- 
mous with speculation. A sense of 
power and security through the use of 
number, has engendered in some quar- 
ters an attitude of apathy or contempt 
for non-quantitative studies and writ- 
ings. Thus the structure of psycho- 
logical science has developed to a great 
extent upon the implicit premise that 
objectivity or scientific truth is ob- 
tainable only through measurement. 
The writer would not object to this 
premise if it were associated with the 
co-premise that psychological measure- 
ments must be rectified for variation in 
the measuring organism, and for the 
atomistic fallacy, i.e., contemporary sta- 
tistics generally deal with parts of phe- 
nomena taken out of their natural con- 
texts. It is as though one would study 
oceanic fish life in a saline aquarium! 
Finally, another reason for the recep- 
tivity of American psychology toward 
quantification is to be found in the 
widespread use of “brass instrument” 
and physiological equipment which were 
forged into psychological tools. 

Why are numbers thought to be in- 
trinsically more objective than words? 
One cannot speak in terms of correla- 
tion coefficients or other statistical 
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units; in the last analysis these must 
be reduced to words for interpretive 
and communicative purposes. Through 
the growth of general semantics we 
have been apprised of the limitations 
and essential characteristics of words 
and word structures. This ever-grow- 
ing sophistication has not yet broken 
the aura which surrounds numbers. In- 
stead of functioning merely as tools, as 
intermediaries between verbally formu- 
lated hypotheses and verbally formu- 
lated inferences, these means-toward- 
the-ends of psychological science have 
become ends in themselves. 

We therefore come to the hypothesis 
that quantification need not necessarily 
be effected through numeration, but 
that the language of relativity may be 
at least as useful, if not more so, as 
numeration in our field, e.g., more, less, 
greater than, fewer of, increase or de- 
crease in, etc. may be utilized more 
profitably in formulating psychological 
data. For example, it has been sug- 
gested that the best criterion evolved 
thus far for validating psychoanalytic 
concepts is in terms of increasing or de- 
creasing tensions or forces rather than 
through numerical quantification. 

We now approach the central hy- 
pothesis of this paper. It is suggested 
that numeration per se is essentially 
atomistic and tends to be distortional 
because it leads to over- or under-de- 
termination of meaning. By over-de- 
termination we refer to the tendency to 
over-generalize from limited data. By 
under-determination of meaning we re- 
fer to an excessive narrowing of the 
horizon of application of specific data. 
For example, an’ I.Q. of 55 on the 
Wechsler-Bellevue Adult Intelligence 
Scale might be misinterpreted in the 
direction of under-determination of 
meaning by asserting that this merely 
shows that the individual has an I.Q. of 
55 on the Wechsler test. Or one might 
commit the fallacy of over-determina- 
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tion by saying that this score proves 
that the individual is feebleminded. 
Contemporary psychological statistics 
would seem to be walking a tightrope 
between the serious pitfalls of over-de- 
termination and under-determination as 
two equally unfortunate end-products 
of the reduction of psychological phe- 
nomena to quantitative terms. It is 
suggested that numeration unavoidably 
(in the present state of statistical de- 
velopment) produces a state of over- 
and/or under-determination of meaning 
by means of the atomistic fallacy on the 
one hand (for something is measured 
out of context or as if there were not 
a context), and by inducing a false 
sense of security, derived ontogeneti- 
cally by the use of numbers. By virtue 
of this sense of security one is led into 
inferential thinking based on the overly 
abstracted data of statistics. 

Contemporary statistics, in becoming 
increasingly complicated, removes itself 
more and more from the world of fact 
which it purports to describe. 

As was elucidated by Maslow,’ the 
tendency has developed in contempo- 
rary psychology “to consider that the 
essence of science lies in its instruments, 
techniques, procedures, apparatus, and 
its methods rather than in its prob- 
lems, questions, functions, or goals.” It 
would appear as if psychological sta- 
tistics is becoming more of an end in 
itself rather than as a means for the 
refined analysis of data which are ob- 
tained in the attempted solution of 
problems. This trend is apparent every 
time a psychologist is asked by his col- 
league, “What are the correlations?” 
Somehow the major unsolved issues of 
psychological science are frequently lost 
in a quasi-chaotic maze of numbers. 
This preoccupation with techniques 
rather than problems, per se, leads to 

2A. H. Maslow, Problem-centering versus 


means-centering in science, Phil. Sci., 1946, 13, 
326-331. 
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a number of fallacies which Maslow 
summarizes as follows: 


1. An overemphasis on polish and 
elegance rather than creativeness 
and vitality in research. 

. Giving the leading positions in 
psychology to technicians rather 
than discoverers. 

. An over-valuation of quantifica- 
tion for its own sake regardless of 
the meaningfulness of the data. 

. The forcing of problems into the 
molds of techniques rather than 
elaborating techniques for the 
solution of problems. 

. Intensification of a false hierarchi- 
cal system among the sciences. 

. The neglect of the problems of 
values with a consequent blurring 
of the criteria for judging the 
worth or importance of a piece of 
work. 


Returning to the concept of over- or 
under-determination of meaning, as a 
distortional resultant of the application 
of statistics to psychology, it would ap- 
pear that this hypothesis is related to 
the more crucial question of how one 
goes about verifying hypotheses in psy- 
chology. The boiling cauldron of opin- 
ionated clashes between behaviorists, 
psychoanalysts, and Gestaltists boils 
down to one essential core of disagree- 
ment: What are the criteria of scientific 
truth in psychology, i.e., what are the 
rules for verifying psychological hy- 
potheses? These three major schools 
of psychology differ fundamentally on 
the advisability of applying quantita- 
tive methods toward the solution of 
certain problems and over the limita- 
tions and frontiers of psychological sta- 
tistics. For example, many academic 
psychologists condemn certain areas and 
approaches in clinical psychology and 
psychoanalysis on the grounds that 
these fields do not subject their mate- 
rial to rigorous statistical checks. It 
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would seem to be more fruitful to ask 
whether the problems in clinical psy- 
chology lend themselves to solutions 
through the use of techniques devised 
in academic psychology. Much futile 
controversy proceeds along the lines of 
argument over methods, instead of a 
refined analysis of the problems them- 
selves, and the forging of whatever 
techniques are necessary for specific 
problems. Psychology, like any other 
science, cannot hope to progress ap- 
preciably through the continued reap- 
plication of old techniques to newer 
problems and by constantly working on 
the assumption that a quantitatively 
exact method necessarily yields a valid 
answer to any problem. 

In the last analysis all quantitative 
material must be converted into lin- 
guistic forms. The more complicated 
the statistical method, the more it re- 
moves itself from the original raw data 
and the more speculation is involved in 
retracing the steps from the final sta- 
tistical expressions to the meaningful- 
ness in the original data. It must be 
granted that quantification facilitates 
regularity, systematization, and a cer- 
tain kind of objectivity, but the careful 
use of numbers sharpens and objectifies 
our verbal formulations themselves to 
semantic analysis. In the chemical 
analogy, refined microanalysis is mean- 
ingless in a contaminated flask. We 
have passed through an era in which 
many Statistical refinements have been 
elaborated but little progress in seman- 
tic insights has been made. Neverthe- 
less statistical techniques are practi- 
cally idolized, while semantics is cate- 
gorized by most psychologists along 
with palmistry and phrenology. The 
broader scope of semantics, as devel- 
oped thus far by Carnap, Korzybski, 
and others, includes basic scientific 
epistemology, the dynamics and social 
implications of scientific problems, and 
a vigorous onslaught in the field of 
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verbal exactitude in the formulation of 
scientific hypotheses. 

An undue preoccupation with quan- 
tification may also represent an escape 
from the central arena of psychological 
issues through an unstructured eclec- 
ticism and an apparent lack of em- 
phasis upon deep creative thinking. 

Many contemporary psychologists 
have developed a habit-pattern of view- 
ing phenomena only in terms of their 
measurable units, to wit, Thorndike’s 
famous dictum: Everything that exists 
exists in some amount. Thus, for 
Thorndike, every psychological phe- 
nomenon is ultimately measurable. One 
could reasonably take issue with this 
position on its @ priori and sweeping 
grounds. However, even taking this 
formulation for what it is, the implicit 
essence of this dictum is the static, 
atomistic character of behaviorism. The 
writer would rather suggest that every- 
thing that exists exists in some amount 
with relation to everything else that 
exists, and that existences and amounts 
are relative for time-space, context, and 
extenuating circumstances. Thus, to 
express a psychological variable in 
quantitative terms requires a non- 
quantitative orientation of the meas- 
ured variable to the entire context from 
which it was extracted; this orientation 
is achieved through verbal description 
and a verbal placement of the measured 
variable in time and space. If, there- 
fore, quantitative data are to have any 
meaning in a pure or applied sense, 
they must be anchored to the original 
context and qualified as to limitations 
of meaning; this is attained through the 
use of words and not numbers. Words 
can thereby prevent numbers from 
yielding over- or under-determination 
of meaning. 

From the elementalism of the ancient 
Greeks through the monads of Leibnitz 
to the stimulus-response units of Wat- 
son, a continuous thread of atomism 
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has been prevalent. Contemporary dy- 
namic psychology, representing a fusion 
of the two major tributaries of Gestalt 
psychology and psychoanalysis, rests 
largely upon the protest against the 
strongly ingrained atomistic pattern of 
psychological thought. Statistics lends 
itself with tremendous facility to the 
atomistic approach. But how easy it is 
to fall into the traps of over- or under- 
determination! For example, a simple 
measure of central tendency yields over- 
determination of meaning by creating 
the illusion that a particular median 
describes the entire population whence 
it was derived. On the other hand, the 
median is surely something more than 
a mid-score in a series arranged in the 
order of magnitude. In a sense, a 
median does describe a population, but 
only in terms of the range, standard 
deviation, sampling factors, size of the 
population, etc. Modern statistics cer- 


tainly teaches that a median has little 
meaning if it is not considered in rela- 


tion to the general characteristics of 
the population whence it was obtained. 
These characteristics may be referred 
to as contextual referants. Further- 
more, these contextual referants must 
be contrasted with those of other popu- 
lations if one is to meaningfully com- 
pare medians. 

While many psychologists have taken 
these points into account in using and 
examining measures of central tendency 
and variability, they have, in the 
writer’s opinion, failed thus far to ap- 
ply such methodological refinements to 
the fields of correlation and factor 
analysis. 

Linear correlations have become a 
sine qua non in the research of aca- 
demic psychology. It is very difficult 
to resist the temptation of subjecting 
two mysterious variables to a correla- 
tional procedure and harvesting a more 
mysterious coefficient of correlation! 
Now, not only is the question of rela- 
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tionship inadequately defined, but the 
correlation coefficient, in order to be 
rendered meaningful, must be qualified 
in so many ways as to make the pro- 
cedure laborious and complicated. For 
example, the normality of the distribu- 
tion of the scores of each variable is 
taken for granted; one should take 
skewness and kurtosis into account. 
Furthermore, the degree of commun- 
ality between the two variables is rep- 
resented by the square of the correla- 
tional coefficient rather than by the 
coefficient itself. It is also necessary to 
consider the essentially atomistic char- 
acter of linear correlation, for one is 
correlating a unit with another unit, a 
factor with another factor, with no 
contextual referents. One has yet to 
evolve a method for correlating one 
Gestalt with another Gestalt, or for in- 
corporating time-space referants into a 
correlational coefficient. Of course, the 
saddest blind spot of all is the construc- 
tion and supposed verification of psy- 
chological hypotheses through the use 
of correlational analysis ignoring the 
fact that correlations give measures of 
concomitance and not causality. Thus 
the coincidental fallacy, a relic of primi- 
tive and common-sense thinking, is fos- 
tered by correlational methods which 
are all right in themselves but have 
been misused and overused in our field 
more than in any other discipline. It 
it also clear that much more must be 
done in the field of multiple correlation 
and regression equations and that psy- 
chologists become less dependent upon 
linear correlation in proportion as they 
recognize that this procedure is a natu- 
ral accompaniment of atomism and re- 
ductionism and that correlational meth- 
ods were developed in the fields of 
agronomy and population statistics. 
This exemplifies a point emphasized 
above that the difficulty with psycho- 
logical statistics has been the presag- 
ing of hypothesis formation by avail- 
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able methods. In other words, since 
correlational methods were carried into 
the field of psychology, the modern 
psychologist is faced with two alterna- 
tives in this regard: One is to go 
through the qualifying procedures in 
the use of correlation coefficients, as 
suggested above; this becomes more 
laborious than a careful detailed verbal 
delineation. The second alternative is 
to indulge the blind use of correlational 
methods and, thus, satisfy one’s inner 
needs for being “scientific.” The psy- 
chological journals are saturated with 
an abundance of papers which follow 
the second alternative. The discerning 
reader is continuously asking himself 
the inevitable “so what?” after reading 
much of this material. For, after all, 
if a paper starts out with a problem 
posited in verbal terms and winds up 
with a summary of quantitative results, 
the job is only half done and the reader 
remains in mid-air. On the other hand, 


the careful, configurational and dynamic 
interpretation of the statistical results 
is extremely difficult in the light of the 
intrinsic limitations of psychological sta- 


tistics thus far. Would it not, there- 
fore, seem more plausible and necessary 
for us to elaborate rules and techniques 
for global interpretation of statistical 
results rather than attempting to refine 
the statistical methods, per se? 

A frontal and unashamed self-exami- 
nation of psychology would, it is sug- 
gested, reveal an increasing trend to- 
ward the substitution of the easy and 
automatic statistical methods for deep 
creative insights and formulations. If 
creative inspiration is being plowed un- 
der by a rigid insistence upon numera- 
tion, then there is little cause for sur- 
prise at the paucity of fundamental psy- 
chological discoveries. The house of 
contemporary psychology is a veritable 
confusion of development and refine- 
ment of methods—but methods for 
what problems? 
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The epics of science, the discoveries 
of lasting and stimulating value, have 
carried in their wake the elaboration of 
new methods. Few significant discov- 
eries have followed the persistent reap- 
plication of old methods which were ap- 
plicable to older, now solved, problems. 
The discovery of the tubercle bacillus 
opened a large chapter in bacterial dif- 
ferential staining. The discovery of 
X-rays preceded the application of 
this remarkable phenomenon to medical 
diagnosis. Roentgen did not apply 
traditional methods to the discovery of 
the X-ray, nor did he believe that he 
would benefit clinical medicine. His 
discovery resulted from the simultane- 
ous occurrence of a phenomenon not ex- 
plicable in terms of established theory, 
together with his keenness of percep- 
tion and reasoning. Roentgen had the 
flexibility of mind to formulate new 
hypotheses for old data and to pick up 
the small thread of a chance observation 
and carry on from that point. While 
we certainly know very little about 
the psychology and psychopathology 
of creative thinking, one thing seems 
clear: Flexibility is the life blood of 
any science, especially one so encom- 
passing as psychology; and that a 
method is valuable only to the extent 
that it will facilitate the elaboration of 
hypotheses which cannot be adequately 
approached by that method. There- 
fore the usefulness of a scientific method 
inheres in the explicitness of the limita- 
tions of its usefulness. In psychology 
the use of statistics could become more 
meaningful should psychologists make 
its limitations more explicit. 

It might be helpful to illustrate the 
presaging of research methods by care- 
fully refined formulations of problems 
and hypotheses from the field of psy- 
chological science itself. The persist- . 
ent efforts of the Wiirzburg school, 
headed by Mach, to discover some- 
thing about the psychology of thinking 
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through the repeated use of the method 
of controlled introspection yielded little 
tangible result; the method of hypnosis 
as used in the psychiatric clinic in Paris 
proved inadequate to account for and 
treat certain kinds of neurotic disor- 
ders. Freud, essentially a pragmatist, 
formulated certain hypotheses out of 
which the psychoanalytic method grew 
and out of which, in turn, an elabora- 
tion of psychoanalytic theory ensued. 
Thus it is seen that vigorous advances 
in psychology, as in other sciences, fol- 
low upon a reversible interaction of 
methodological refinement and formula- 
tion of hypotheses. The attempt to 
force a new hypothesis out of an old 
method is essentially inductive; the ef- 
fort to formulate a new hypothesis 
purely by inspiration and then draw 
whatever strands of new methods one 
can out of this hypothesis is essentially 
deductive. All will agree with modern 
logicians that science is inductive-de- 
ductive, and to preoccupy one’s self 
with either one of these two components 
of scientific reasoning is rigid and mis- 
leading. Therefore quantification in 
psychology helps, to a certain extent, 
to lay the ground-work for new formu- 
lations in a given area, provided that 
area lends itself to measurement in our 
present stage of development. This 
tendency has, it is suggested, been over- 
developed on our contemporary scene 
with a derogation and lack of emphasis 
upon clearly formulated and meaning- 
ful problems, the semantic refinement of 
their manner of expression, and a sub- 
sequent delineation of the role of sta- 
tistics in the attempted solution of the 
particular problem. 

Lest the reader deduce that a polari- 
zation of quantitative versus non-quan- 
titative methods is posited here, it 
should be stressed that among the fasci- 
nating revelations of scientific history 
one finds the greatest progress and most 
rapid rate of growth in borderland fields 
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such as physical chemistry, biochem- 
istry, psychosomatic medicine, etc. That 
is to say, two possible borderlands in 
psychological science which remain to 
be explored are: 


1. The dynamic inter-connections be- 
tween the various quantitative and non- 
quantitative methods with an elaboration 
of the mutual dependencies and limitations 
of each. 

2. The study of the relationships be- 
tween numerical and semantic analysis to- 
ward the end that the present dichotomy 
between measurement and description be 
permanently relegated to the past of Aris- 
totelian logic. This would go a long way 
toward the reduction of tension and hos- 
tility between academic psychology, based 
largely upon numeration, and clinical psy- 
chology, accustomed essentia'ly to verbal 
characterizations. 


This brings to the fore the essential 
basis for clash between academic and 
clinical psychologists, since the primary 
issue at stake is quantification. The 
academic psychologist has restricted the 
scope of his investigations in order to 
fit them into available statistical meth- 
ods. This is only half of the meth- 
odological fallacy involved, for the 
academic research worker takes too 
much for granted in the utilization of 
present day statistics. A fusion should 
be attempted in the form of discovering 
the area of communality between the 
strengths of quantification and the 
weaknesses of verbalization, and vice 
versa. In this manner, these two great 
tools of the human mind, word and 
number, would buttress and facilitate 


each other at their mutually weak 


points rather than thwart the develop- 


‘ment of psychology by mutual inhibi- 


tion. Since both word and number are 
symbolic representations of inner and 
outer reality and constitute our chief 
means of social communication, it 
might be well for research workers to 
study the methods of hypothesis for- 
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mation and hypothesis verification in 
psychology as a primary pursuit. The 
relative emphases and limitations of 
words and numbers would then grow 
naturally out of the rules derivable 
therefrom. Only in this way can psy- 
chological research gain a certain vi- 
tality and courage which it lacks as 
long as it continues to build its hy- 
potheses on the structure of old meth- 
ods. 

Not only does the use of statistics 
tend to produce over- or under-deter- 
mination of meaning, but also over or 
under self-confidence of the psycholo- 
gist. On the one hand, the psycholo- 
gist does not trust himself to resort to 
sufficient use of imagination and intui- 
tion. On the other hand, quite para- 
doxically, he appears to trust himself 
too much when he fails to realize that 
the use of numbers does not preclude 
the operation of bias and the selective 
nature of human attention and percep- 
tion, let alone the powerful influence of 
unconscious motivation. Nor does the 
use of numbers necessarily imply ob- 
jectivity and authenticity; a cloud of 
Statistics may very well befog real 
issues. 

Those who tend almost to idolize 
quantitative methods function in their 
work as if the following were synonyms: 
measured, objective, scientific, empiri- 
cal, experimental; and as though the 
following, likewise, were identities: 
non-quantitative, speculative, Freudian, 
mystical, clinical, unscientific. Both 
academic and clinical psychologists hurl 
the epithet at each other of either be- 
ing unscientific through failure to ade- 
quately use numbers or for being un- 
scientific for failure to work with mean- 
ingful problems. It is surely time for 
a fusion of emphases as between quan- 
tification for its own sake and the ne- 
glect of the positive virtues of statistics. 
This fusion can be partly attained, it is 
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suggested, through the use of ordinal 
scales and semantics. 

In the field of psychometrics and 
aptitude testing it is often forgotten 
that a score of zero on a test does not 
necessarily mean zero ability or apti- 
tude; a score is not a fact. The score 
has been abstracted from other possible 
measures of the same ability; it is far 
from perfect in validity and reliability, 
etc. These limitations of test scores 
and their de-contextual character have 
been worked out by psychometricians 
and theorists. But the implication is 
that the gulf between data and facts is 
ever-widened by each further operation 
of a statistical nature. Hence, it would 
seem desirable to maintain simpler sta- 
tistical procedures. Measures of cen- 
tral tendency and variability and meas- 
ures of the significance of the difference 
in means, etc., remove themselves only 
slightly from the original data and the 
possible interpretations of these data. 
On the other hand, correlational and 
factorial analyses depart very far in- 
deed from the original data and make 
many more assumptions which are not 
entered explicitly into the working sys- 
tem. One test which might be applied 
for the safety of statistical method is 
the relative ease with which it can be 
explained to the intelligently unso- 
phisticated. 

All statistics thus far have been de- 
veloped in the direction of inter-indi- 
vidual differences. In conjunction with 
such clinical concepts as “normality” 
and “self-consistency,” it may be inter- 
esting to develop intra-individual sta- 
tistics. However, this does not imply 
the carry-over of inter-individual meth- 
ods to the problems of intra-individual 
variation but rather the elaboration of 
new methods. To be able to measure 
the amount of erratic tendency in an 
individual or the variation in his in- 
stinctual strivings would find valuable 
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application. Perhaps inverted factor 
analysis will supply the answer. It 
must, however, be based upon some 
formulation of personality theory which 
has a clinical anchorage. 

In conclusion, quantification facili- 
tates the growth of psychology to the 
extent that it is subsumed under the 
broader realm of qualitative description 
and explanation. Statistical methods 
inhibit vigorous growth in psychology 
to the extent that they promote atom- 
istic, categorical thinking, and over- or 
under-determination of meaning. Quan- 
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titative emphases are further damaging 
in that they imply that qualitative 
methods are not subject to the same 
rules of verification and authenticity, 
and are necessarily less reliable than 
quantitative methods. 

The expansion of the frontiers of 
psychological science will surely be fa- 
cilitated by some fusion of qualitative 
and quantitative approaches through a 
new, yet-to-be-devised, system of se- 
mantic-statistical analysis. 


[MS. received May 16, 1949] 
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I 


The methods based upon Thurstone’s — 
Law of Comparative Judgment (3, 4) 
seem the most rational yet devised for 
ordering a group of stimuli or concepts 
on a psychological continuum when 
the corresponding physical continuum 
is unknown, unmeasurable, or otherwise 
lacking. These methods suffer from 
certain experimental handicaps, how- 
ever, which seriously limit their prac- 
tical usefulness. Thus, if all possible 
paired comparisons are made, the experi- 
mental session must include nn) 
judgments: even with a fairly small 
number of stimuli (e.g., 15) the task is 
a tiring one, and with a large m (e.g., 
50) it becomes virtually impossible. 
Considerable economy is achieved by 
requiring subjects merely to rank-order 
the stimuli, and deriving the necessary 
paired comparisons from the data thus 
obtained, but this method (besides in- 
volving an extremely laborious tabulat- 
ing procedure) usually precludes the 
possibility of obtaining judgments from 
subjects in groups, since each subject 
must be given the experimental mate- 
rials to manipulate. 

Rating-scale methods, in contrast, 
represent an ideal of experimental con- 
venience: the minimum number of judg- 
ments involved is only one for each 
stimulus, and the stimuli may usually 
be displayed to large groups of subjects 

1The author wishes to thank Dr. Quinn 
McNemar and Dr. Donald W. Taylor, both of 
Stanford University, and Dr. Harry Helson, 


of Brooklyn College, for reading the initial 
draft of this article. 


simultaneously. Unfortunately, how- 
ever, the rating-scale yields seriously 
distorted results when scored according 
to the obvious assumption that the 
scale intervals are psychologically equal. 
Hevner (2), on comparing empirically 
the equal-appearing intervals method 
with the Thurstone methods, found a 
markedly curvilinear relationship be- 
tween the results: the intervals near 
the end of the scale were found to be 
“larger” than those in the middle. The 
source of this distortion is fairly evi- 
dent on a priori grounds. The ratings 
of a given item by a number of judges 
will, when unrestricted, tend to dis- 
tribute themselves normally about the 
hypothetically “true” rating; but when 
the “true” rating falls at an extreme, 
variability is possible in only one direc- 
tion. The distribution obtained in the 
latter case will therefore be skewed 
away from the end of the scale, yielding 
a mean (or median) too close to the 
center. 

The method about to be described 
preserves the experimental simplicity of 
the rating-scale technique, but treats 
the ratings in a manner which ration- 
ally avoids the end-distortion just men- 
tioned. 


II 


This method may be considered an 
extension of the Method of Single 
Stimuli (5, 6). In the latter the sub- 
ject is required to classify each of a 
number of stimuli (e.g. lifted weights) 
into one of two opposite categories 
(e.g. “Heavy” and “Light”), an inter- 
mediate judgment being also permitted 
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in some cases. After becoming familiar 
with the series, the subject adopts a 
fairly stable subjective standard, or 
adaptation-level (1), so that the fre- 
quencies with which successive stimuli 
are assigned to a given category de- 
scribe a normal ogive.? Knowing the 


proportion of times which a subject (or 
group) assigns a given stimulus to each 
of the categories, we may consider that 
the psychological value of that stimulus 
is given by the deviate of the normal 
curve corresponding to that proportion. 
If, for example, four different weights 


10 


50 60 


335 


“Heavy” (for illustrative purposes we 
choose a middle category which is not 
quite neutral in its connotation). We 
may dichotomize the ratings thus ob- 
tained for a given weight about either 
of two points: the “Light’”—“Medium 
Heavy” division or the “Medium 
Heavy’—“Heavy” division; that is to 
say, we may combine the “Medium 
Heavy” judgments with either the 
“Heavies” or the “Lights,” and thence 
proceed as in the two-category case. 
Ideally these two ways of dichotomiz- 
ing should give us precisely equivalent 


80 % JUDGMENTS 


"HEAVY" 


PSYCHOLOGICAL 
CONTINUUM 


Fic. 1 


have been judged “Heavy” 10, 50, 60, 
and 80 per cent of the time, respec- 
tively, we find the corresponding devi- 
ates — 1.28, zero, + .25, and + .84, 
and assume that these figures indicate 
the spacing of the stimuli along the 
psychological continuum, as illustrated 
in Fig. 1. 

Let us now consider a situation in 
which three instead of two categories 
of judgment are permitted: these might 
be “Light,” “Medium Heavy,” and 


2 We are here making certain assumptions— 
essentially those of Thurstone’s Case V (4)— 
which we shall presently examine in more de- 
tail. 


results in terms of the obtained sepa- 
rations of stimuli on the psychological 
continuum. Thus the four weights con- 
sidered above might be ordered as in 
Fig. 2. In this illustration the psycho- 
logical values (X; and obtained 
from the two ways of dichotomizing 
differ by 1.25 o units. This constant 
represents the psychological width of 
the “Medium Heavy” category. In a 
practical situation, where the corre- 
spondence is not so precise, the mean 
difference between X, and X2 would be 
taken as the width of the intervening 
interval. 


= — | 
| 
7 
| | 
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equally well to a rating scale of more 
than three categories. If we use a 
scale with & categories, there are k — 1 
ways in which we may dichotomize the 
obtained data. Since the ratings of a 
given stimulus will usually not cover 
the entire scale, the relative psycho- 
logical position of that stimulus will 
be indicated only by those dichotomies 
which actually divide its ratings, the 
X-values corresponding to O per cent 
and 100 per cent being indeterminate. 
The size of the interval separating ad- 
jacent dichotomies may be inferred 
from the X-values of stimuli common 
to both, as suggested above; thus, by 
the addition of suitable constants, X- 
values from all the dichotomies may 
be rendered comparable, and fitted to- 
gether on a single psychological con- 
tinuum. We shall refer to such a treat- 


ment of ratings as the Method of 
Graded Dichotomies. 


Application of the method involves 
the following computational steps: 

1. A table is prepared, the rows of 
which represent stimuli and the columns 
rating-scale categories, showing for each 
stimulus the percentage of ratings fall- 
ing within each category. 

2. From this a second table is made, 
with a column for each place at which 
the ratings are to be dichotomized, 
showing for each stimulus the percent- 
age of ratings falling above each cut- 
ting-point. Most complete use of the 
data is presumably made by using all 
possible (& — 1) dichotomies, but satis- 
factory results may be obtained with 
some omissions if & is large and ratings 
are broadly dispersed. 

3. By the use of a table of the nor- 
mal curve (or, preferably, of the nor- 
mal ogive) the above percentages are 
translated into x/o units. These X- 
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values, as we shall call them, are desig- 
nated as positive when the percentage 
above the cutting-point is greater than 
50 per cent, and negative when it is less 
than 50 per cent. Extreme percentages 


(usually those less than 5 per cent or, 


greater than 95 per cent) are discarded. 

4. For each pair of adjacent dichoto- 
mies, the mean difference between the 
X-values of those stimuli represented 
in both is calculated. This is taken as 
the magnitude of the interval separat- 
ing the two cuts. 

5. To the X-values of each dichot- 
omy is added the sum of the intervals 
below that dichotomy. An additional 
constant may be added to all the X- 
values in order to make the smallest a 
positive number. The purpose of this 
operation is to refer all the X-values to 
a common (arbitrary) origin, so that 
they will be comparable with one an- 
other. 

(The figures obtained in steps 2, 3, 
and 5 may be recorded in the same 
table if sufficient space is allowed in 
each cell.) 

6. Each stimuius now has one or 
more (equated) X-values, the mean of 
which is taken as the best estimate of 
its relative position on the psychological 
continuum. In theory the X-values for 
a given stimulus should be identical: if 
they differ more than considerations of 
experimental error would lead us to 
anticipate, we may suspect that certain 
assumptions of our method are incor- 
rect. This contingency will be consid- 
ered later. 


IV 


The ratings obtained by Hevner (2) 
from 370 subjects on the goodness of 
20 specimens of handwriting have been 
scaled by the Graded Dichotomies 
Method in the manner just described. 
Hevner’s data constitute excellent ma- 
terial for the empirical validation of 
this method, since she also applied two 
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different methods based upon Thur- 
stone’s Law of Comparative Judgment 
(Paired Comparisons and Order of 
Merit) to the same specimens, using 
the same subjects. Her rating scale 
contained eleven categories, making 
possible ten dichotomies, but the dis- 
persion of the ratings was such that 
only six of the ten were used. Cuts 
were made between the following cate- 
gories: 2&3,4&5,6&7,8&9,9& 
10, 10 & 11. 

Table I shows the value obtained for 
each specimen by the present method 
and by the three methods which Hevner 
employed. In Figs. 3, 4, and 5 the 
values obtained by the Graded Dichoto- 
mies procedure are plotted against those 
from each of the earlier methods. The 


TABLE I 


HANDWRITING SCALE VALUES OBTAINED 
BY Four METHODS 


Method 


Eq.-ap- 
Paired pearin 


Comparison intervals 


Order of 
Merit 


Graded 
Dichotomies 


.00 1.60 
44 2.15 
2.80 
40 ‘ 2.25 
21 ‘ 1.80 
49 80. 2.35 
1.66 5.13 
2.17 6.23 
2.05 6.10 
3.18 8.10 
2.87 7.70 
3.15 8.10 
3.51 8.70 
3.81 9.30 
4.75 9.95 
4.71 9.90 
4.79 10.15 
5.60 10.40 
5.92 10.50 


7.43 
7.69 


The above data, with the exception of the 
Graded Dichotomies values, are reproduced 
from Hevner (2). 
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agreement between the present method 
and the two Thurstone methods (Figs. 
3 and 4) is quite satisfactory, being 
approximately as good as that of the 
Thurstone methods with each other. 
Whereas these two plots show little sug- 
gestion of non-linearity, the relation- 
ship between the Graded Dichotomies 
values and the Equal-Appearing In- 
tervals values (Fig. 5) is conspicuously 
curvilinear, as we should expect from 
Hevner’s findings. It should be pointed 
out that the latter two sets of results 
are based upon precisely the same raw 
data, so that discrepancies are entirely 
a function of differences in treatment. 


Vv 


Some consideration of the assump- 
tions involved in the Method of Graded 
‘ Dichotomies, and of its relationship to 
the Comparative Judgment methods of 


Thurstone, seems desirable at this point. 
In the first place it may be emphasized 
that the method does not assume sub- 
jective equality of rating-scale intervals, 
but instead infers the relative magni- 
tudes of these intervals from the rating 
behavior of the subjects. Neither does 
it assume that the stimuli are distrib- 
uted normally, or in any other specific 
way, with respect to the attribute in 
question. It does assume (1) that rat- 
ings of a given stimulus will, in so far 
as possible under the restrictions of the 
scale, be distributed normally about a 
hypothetically “true” rating, and (2) 
that the distributions of ratings for the 
various stimuli will have identical stand- 
ard deviations except in so far as the 
distributions are subject to curtailment, 
this constant standard deviation being 
the unit in which ratings are finally 
scaled. 
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The latter assumption is in principle 
the same as that of Thurstone’s Case V 
(4), which demands that the stimuli 
compared have equal “discriminal dis- 
persions,” or ambiguity. (The other 
requirement of Case V is concerned 
with the correlation of judgments, and 
has no apparent application here.) It 
should be noted that Hevner (2) as- 
sumed the conditions of Case V in ap- 
plying both the Paired Comparison 
Method and the Order of Merit Method. 

If we are led to believe, either from 
inspection of the original ratings or 
from unexpectedly large discrepancies 
among the X-values to be averaged in 
the final step, that the dispersions of 
ratings for the various stimuli are genu- 
inely unequal, some adjustment for this 
condition is clearly necessary. The de- 
sired correction may be approximated 
by the following operations: (1) After 
completion of step 4 above, for each 
stimulus the difference between the ex- 


treme unequated X-values (i.e., the dif- 
ference between the X-values from the 
highest and lowest dichotomies in which 
the stimulus is represented) is obtained. 
The sum of the intervals (computed 
in step 4) separating the dichotomies 
which provide these X-values is divided 


by this difference. The quotient ob- 
tained represents a ratio between the 
dispersion of a particular stimulus 
and the mean dispersion of comparable 
stimuli (it is necessary to bear in mind 
that tc relative magnitudes of the X- 
values are inversely related to the dis- 
persion of the stimulus). (2) This 
quotient constitutes a correction factor, 
by which each X-value of the stimulus 
under consideration is multiplied. By 
the present operation the dispersions of 
the various stimuli are equated. Steps 
5 and 6 may now be followed using the 
corrected X-values. 


Frep ATTNEAVE 


The procedure just described extends 
the application of the Graded Dichoto- 
mies Method to situations covered by 
Thurstone’s Cases I-IV. Two disad- 
vantages of the procedure must be 
pointed out, however: (1) It assumes 
that the mean dispersion of stimuli rep- 
resented in one dichotomy is equal to 
the mean dispersion of those represented 
in another; this assumption may be 
only approximately correct. (2) Situa- 
tions may arise in which certain (usu- 
ally extreme) stimuli are cut by only 
one dichotomy, in which case the data 
are insufficient for the estimation and 
correction of their dispersions. The lat- 
ter difficulty is minimized by the use 
of a scale with a relatively large num- 
ber of categories. 


SUMMARY 


The treatment of rating-scale data 
by a Method of Graded Dichotomies is 
described. This method avoids the as- 
sumption of equal scale intervals, and 
yields results closely comparable to 
those obtained by Thurstone’s paired 
comparison techniques. 
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AN APPLICATION OF sEz QUANTIFICATION PROCEDURE 


BY DAVID ZEAMAN 


University of Connecticut } 


INTRODUCTION 


A methodology has been proposed by 
Hull, Felsinger, Gladstone and Yama- 
guchi (6) for the quantification of the 
construct, reaction potential (sEz). In 
their paper (p. 252), they state: 


“As has been pointed out, there are 
many different indications of reaction po- 
tential. It is probable that some forms of 
learning can be found which yield two or 
more such objective indications of the very 
same habit strength acquisition, each one 
of which could be used independently as 
reaction latency was above. It would seem 
that such curves of learning even though 
secured from quite different data should 
agree as to the type of best fitting equa- 
tion and also, within the limits of the data 
sampled, as to the various constants. Such 
a determination may very well prove to be 
a critical test of the methodology.” 


It is the purpose of the present paper 
to apply the sEp quantification pro- 
cedure to two different measures of per- 
formance in the same learning situation. 


THE DATA 


Empirical learning curves for the two 
objective measures are presented in 
Fig. 1. The two curves show the course 
of change of median latencies and 
median running-times of 40 rats dur- 
ing 18 conditioned reactions on a sim- 
ple elevated straight runway (Graham- 
Gagné type). Latency is the time the 
animal takes to leave the starting box 
once the door is opened; running-time, 


1 This paper was written while the author 
was a staff member of the Department of 
Psychology, Brown University. 


the period required for traversal of the 
three-foot runway to the goal box.” 

The more important conditions of 
reinforcement include: (1) temporal 
factors—only one reinforcement trial a 
day was given with no delay of rein- 
forcement; (2) kind of reinforcement— 
“process” cheese; (3) amount of rein- 
forcement—an intermediate amount in 
the appetite range, 0.60 gram on each 
trial. The radical spacing of trials is 
necessary to eliminate inhibition fac- 
tors which would complicate the quan- 
tification of what is assumed to be a 
single factor of excitation. 

As measures of excitation or reaction 
potential, both latency and running- 
time have been shown to be sensitive 
to changes in the common experimental 
variables and operations; that is, they 
have been used to demonstrate many 
of the basic behavioral processes (1, 2, 
4, 7, 8, 12, 14). In Hull’s system (5) 
response latency appears formally de- 
fined as one of the four observable 
anchors of sErz. Running-time, how- 
ever, is not given a formal status al- 
though it is used by Hull (5) a num- 
ber of times as an index of reaction 
potential. As has been pointed out 
(14), running-time could easily be in- 
cluded within the framework set down 
in the Principles of Behavior by extend- 
ing the category (A) amplitude of re- 


2 The details of the apparatus and procedure 
have been described in a previous study con- 
cerned with the effects on latency of varying 
amounts of reinforcement (14). The data of 
the present paper are also derived from the 
experimentation of that study, performed in 
the Psychological Laboratory of Columbia 
University. 
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the number of conditioned reactions. 
data. 


sponse to include striate muscle re- 
sponses as well, or by making it a spe- 
cial case of a latency measure. ; 
The empirical learning curves for the 
two measures are quite different, as can 
be seen in Fig. 1. The range of scores 


is much greater for latency than for 
running-time, and would be greater yet 
if the first trial (counted as uncondi- 
tioned) were included. The apparent 


Both median latencies and median runway-times are plotted as a function of 
Different equations are shown fitted to the two sets of 


asymptote of each curve is clearly dif- 
ferent, the latency measure yielding a 
greater final strength. The two curves 
are similar in that they are both fairly 
regular, negatively accelerated, decreas- 
ing functions, undergoing the major 
fraction of their total change in a few 
trials, and approaching an asymptote 
within the 19 trials given. Before the 
quantification procedure is ap- 
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plied to these two different measures to 
discover if, indeed, a common function 
results, it is desirable to express with 
more precision these similarities or dif- 
ferences between the two unquantified, 
empirical functions. To this end, curve- 
fitting procedures were introduced to 
give mathematical expression to the 
two learning curves. 


CURVE-FITTING 


There are of course an infinite num- 
ber of equations which can be fit to any 
set of empirical data. Ordinarily in the 
absence of a theoretical prediction of 
equation form, heavy use is made of the 
Principle of Parsimony in trying first, 
and accepting the best fit among, the 
simpler equation types. 

In selecting equations to be fitted to 
the two empirical learning curves in 
Fig. 1 it is possible to be guided not 
only by the general shape of the curves 
themselves, but by some rational con- 
siderations derived from the nature of 
the phenomenon under investigation, 
and by previous attempts at such curve 
fitting to related data. The rational 
considerations include at least these: 
(1) the equation form must yield a 
finite expression for the starting point 
or first conditioned reaction; (2) it 
must yield an expression for a finite 
asymptote; (3) it must not generate 
any negative values. 

Among the simpler equation forms 
which generate appropriately-shaped 
curves, and meet the criteria, these 
were tried: 


(1) a power function, Y = aN-* + ¢, 


(2) an exponential function, Y = 
ae’ +. ¢, 

(3) a log power function, log Y = 
aN-> + ¢, 

(4) a log-exponential function, log 
Y = ae" + ¢, 


where Y is the measured performance 
and N the number of conditioned re- 
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actions. Felsinger, Gladstone, Yama- 
guchi, and Hull (2) have found that 
a power function closely describes their 
latency learning curve. Graham and 
Gagné (4) used an exponential type of 
equation with success in predicting the 
course of acquisition with latency meas- 
ures. For statistical purposes, logarith- 
mic transformations of time measures 
are frequently employed, so the good- 
ness of fit of the power and exponential 
function should be tested with latency 
and running-time expressed in logarith- 
mic form. 

A graphical technique tested each of 
these equation forms for appropriate- 
ness and goodness of fit to the empirical 
data. For the power function this test 
consists of plotting on log-log graph 
paper the difference between the final 
limit, or asymptote of performance, and 
the actual performance (median latency 
of the group) on each trial against 
the number of previous reinforcements.® 
The rectilinearity of such a plot con- 
firms the choice of equation form and 
measures the goodness of fit. A diffi- 
culty arises, however. Since the asymp- 
tote of performance was neither known 
nor easily estimable from a plot of the 
raw data, it was necessary to assume 
various values and test for linearity of 
the log-log plot. An assumed asymp- 
tote of .55 second yielded a curve most 
closely approximating a straight line. 
The slope of this line was — 1.40, the 
Y-intercept 10.4. These two values are 
the remaining constants in the power 
function which was sought. The equa- 
tion then reads: 


latency = 10.4 N-'4° + .55. 


Figure 1 shows this equation to repre- 
sent adequately the latency data. 


3 Accepting Hull’s convention, the first trial 
is regarded as a measure of unconditioned 
strength, and numbering of performance 
measures starts with the first conditioned re- 
action—on the second day’s trial. 


= 
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The running-time data were first 
tested with the power function since 
that has already proven applicable to 
the latency data. The most nearly 
linear of the log-log plots (as before) 
resulted when an asymptote of zero 
was assumed. Therefore this equation 
form was ruled out on the rational 
grounds of impossibility of a zero as- 
ymptote. 

Next, the running-time data were put 
to the graphical test for the exponential 
equation form. The test requires a 
semi-log plot; the logarithm of the dif- 
ference between actual running-time 
and an assumed asymptote plotted 
against number of previous reinforce- 
ments. The results were most satis- 
factory with an asymptote assumed 
at 1.4 seconds. Computation by the 
method of averages gave the slope and 
Y-intercept of the straight-line func- 
tion, which in turn are the remaining 
two constants in the final equation: 


running-time = 3.64 X 10-11 + 1.40. 


This same test for the exponential 
type of equation was applied to the 
latency values, to the logarithm of the 
latencies, and to the logarithm of the 
running-times. The log scores of each 
measure were also tested with the power 
function. The results of these five tests 
were negative. Either there were ob- 
vious non-linear trends in the log-log 
or semi-log plots when a _ reasonable 
range of asymptotes had been sampled, 
or the random deviations (squared) 
from the best fitting straight line aver- 
aged out larger than those found in the 
tests yielding the two equations written 
above. 


sEp QUANTIFICATION 


This section prepares the answer to 
the question: What does sEp quantifi- 
cation do to the two different empirical 
learning curves—for latency and run- 
ning-time? The details of the meth- 
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odology have been set forth with ad- 
mirable simplicity by H.F.G. & Y.* as 
adapted from Thurstone (9, 10, 11). 
It is only necessary therefore to pre- 
sent evidence of the soundness of the 
assumptions on which the methodology 
is based before giving the results. . 
The first assumption concerns the 
homogeneity of the variability in ox, 
an estimate of the dispersion of the 
individual variables, said to be related 
to Hull’s “oscillation” of reaction po- 
tential. The o;,’s are used to rectify 
certain scale distortions in the quantifi- 
cation. The differences between any 
pair are assumed to be not very large. 
A two-to-one ratio of largest o, to 
smallest o; is considered sufficient 
homogeneity. Figure 2 shows the trial- 
by-trial changes in o,. It can be seen 
that as learning proceeds there is much 
variability in the estimated individual 
dispersions: random variation due to 
limited size of the sample, and sys- 
tematic variation due to the progress 
in learning. These curves resemble the 
o, function presented by H.F.G. & Y. 
in that they have a wave-like appear- 
ance. They differ in that the waves 
are greater in amplitude and perhaps 
shorter in length. The increase in am- 
plitude of variation would be expected 
since the N of this study was only two- 
thirds that of the H.F.G. & Y. study. 
There are two points of major im- 
portance to be made about Fig. 2. The 
first involves a comparison between 
the order of magnitude of the o; varia- 
tion in this and in the H.F.G. & Y. 
study; the second considers the rela- 
tion between o; variability of the laten- 
cies and of the running times. Inspec- 
tion of Fig. 2 shows that most of the 
o;’s of the present study fall within the 
range of those in the H.F.G. & Y. study. 


4For convenience, continued reference will 
be made to the series of papers by Hull, Fel- 
singer, Gladstone and Yamaguchi (2, 3. 6, 
13) by the abbreviation H.F.G. & Y. 
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Fic. 2. The dispersion of the individual variables (¢,), used in the present quantification 


procedure, as related to the progress of learning. 


The two horizontal dashed lines indicate the range within which fell all the 7, values used in 


the H.F.G. & Y. study (6). 


If three of the 36 o% values were ex- 
cluded, the remaining o;’s would ap- 
proximate the 2-1 ratio criterion. This 
degree of heterogeneity throws some 
doubt upon the assumption of homo- 
geneity, but it is not considered great 
enough to completely vitiate the ulti- 
mate comparison of the sEz functions 


generated by the quantification pro- 
cedure. The close parallelism of the ox, 
curves for the two measures fortifies 
this judgment since it is likely that 
whatever distortion is produced by the 
heterogeneity would affect both sEpz 
functions roughly to the same degree 
leaving the relation between them un- 
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disturbed. The primary purpose of the 
paper is of course to find whether the 
resultant sEr curves are the same or 
different. 

A second assumption upon which the 
quantification procedure is based is 
tested by plotting against each other 
any two columns of the tables enumer- 
ated by H.F.G. & Y. 5 and 7. Ten 
pairs of columns from Table 5 for each 
of the two measures were plotted, and 
ten pairs of columns for each measure 
from Table 7. The outcome of these 
tests was substantially the same as that 
found by H.F.G. & Y. (6). The plots 
from Table 5 were approximately linear, 
as were those from Table 7, but the 
slopes of the Table 7 plots came closest 
to the desired unity. The slope of each 
graph was computed. The average 
lope of the 10 plots for the running- 
time data from Table 7 was .98 with 
an average deviation of .09. The av- 
erage slope for the corresponding la- 
tency values was .97, with an average 
deviation of .09. 

These findings confirm the appropri- 
ateness of Thurstones Case V method 
for these data by lending validity to the 
dispersional assumptions underlying it. 

The quantification is based upon the 
assumption of normality of distribution 
of gEpr’s and gEp differences. The 
soundness of this assumption is given 
support by the evidence presented in 
the last paragraph. A more precise 
method of testing the normality of 
sEp’s has been employed by H.F.G. & 
Y. The test consists of: 


1. completing the quantification using 
the normal probability tables and de- 
riving the equation which expresses the 
relation between sEpr and gtr; 

2. converting a number of asymp- 
totic latencies of each animal into sE,’s 
by use of this equation; 

3. constructing a frequency distribu- 
tion of these sEp’s corrected for indi- 
vidual asymptote differences; 
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4. making another distribution of the 
sEp differences at the latency asymp- 
tote; 

5. computing the deviation from nor- 
mality of these distributions in terms of 
skewness and kurtosis. 


H.F.G. & Y. applied this test to their 
data and found the distributions of 
sEp’s to deviate from normality in the. - 
direction of leptokurtosis. They cor- 
rected for this distortion by construct- 
ing an empirical leptokurtic probability 
table and running through the quantifi- 
cation procedure again. 

This test and refinement were not 
employed in the present study for sev- 
eral reasons. The prohibitive amount 
of work involved in the computations 
necessary for the re-quantification may 
be, perhaps, a real reason. Good rea- 
sons incluce the fact that only about 
five asymptotic latencies were available 
for each animal as compared with the 
20 such latencies in the H.F.G. & Y. 
study. This would severely limit the 
size of the distribution of sEp’s and 
thus limit the accuracy of the correc- 
tion to be made on the basis of the 
shape of the distribution. The best 
reason lies in the fact that H.F.G. & Y. 
found the correction for leptokurtosis 
to make so little difference in the final 
sEp equation. Only one constant, the 
coefficient, was changed and that by a 
factor of only 10.6 per cent. 

There seem to be sufficient grounds 
for the assumption that whatever dis- 
tortion is caused in the two sEp curves 
of this study by the use of normal 
probability tables is not great and/or 
does not affect the two sE,p curves dif- 
ferentially. 


RESULTS 


The two curves presented in Fig. 3 
comprise the major findings. They are 
two sE, learning curves derived from 
two different objective measures of per- 
formance in the same learning situa- 
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Fic. 3. Two theoretical learning curves derived from the application of the sEx quantifica- 


tion procedure to two empirical measures of performance in the same learning situation. 


two functions are significantly different. 


tion and from the same animals. That 
the two curves are different is obvious 
by simple inspection. No statistics are 
needed to establish the true difference 
between the curves other than _ this 
statement: If there is no difference be- 
tween the curves, the changes of the 
entire 17 points of the running-time 
curve falling as they do above the cor- 
responding points of the latency curve 
are equal to the chances of tossing a 
coin 17 heads in succession. 

More precisely how the two curves 
differ may be determined by the equa- 
tions fitted to them. The problem of 
equation fitting here was somewhat 
simplified. Since the significance of 
this study derived from its inclusion 
within a systematic framework, the 
choice of an equation type may be such 
as to facilitate comparison with other 
parts of the system. Hull (5) has used 
exclusively the exponential type func- 
tion to represent habit growth and has 
related tentatively the constants of this 
type of equation to certain of the ex- 
perimental variables. - For these rea- 
sons the best-fitting exponential type 
function was found for each set of data. 


The 


The two equations read: 


sEp (latency) = 3.6 (1 — 10-), 
sEp (running-time) = 4.7 
(1 — N), 


No greater precision in the approxima- 
tion of equation constants than is indi- 
cated is required to give fair representa- 
tion to the data and at the same time 
mediate the quantitative comparison 
which is the goal of the paper. 

To be compared with the above two 
sEp equations is the equation found by 
H.F.G. & Y. (3): 


sEp (latency) = 3.084 
(1 — + .04. 


Their equation was found to hold over 
the course of 60 learning trials. If this 
equation were graphed for the first 17 
trials and plotted in Fig. 3, the resultant 
curve would fall far below the running- 
time and latency curves already there. 

The relationships among these three 
functions require analysis and further 
discussion. 


DISCUSSION 


1. The large difference between the 
sEp (latency) function from the H.F.G. 
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& Y. study and the present one would 
be predicted on the basis of difference 
in amounts of reinforcement.’ The 
amount in the present study was .60 
gram; that of the H.F.G. & Y. study 
.08 gm. The asymptotes of the two 
functions are only slightly different 
(respectively, 3.6 against 3.1); a larger 
relative difference appears in the rate- 
of-approach constant (.09 against .03). 
The latter difference expresses itself 
graphically in the fact that the latency 
curve in Fig. 3 closely approaches its 
asymptote within 17 trials, whereas the 
H.F.G. & Y. latency curve requires about 
three times this number of trials to ap- 
proach its final limit. If this analysis 
is true, it leads to the conclusion that 
the parameter of amount of reinforce- 
ment affects the rate-of-approach con- 
stant part of the habit strength equa- 
tion rather than the asymptote part as 
suggested by Hull (5). This is to be 
contrasted with a previous finding (14) 
that when exponential equations are 
fitted to a family of latency learning 
curves generated by the amount pa- 
rameter, it is the asymptote that varies 
and not the rate-of-approach constant. 
It is possible that both of these find- 
ings are true, since there is no simple 
one-to-one relationship between an sEr 
learning curve and an empirical learn- 
ing curve both derived from the same 
data. Comparison of Figs. 1 and 2 
shows this very well. 

2. If latency and running time are 
accepted as objective indices of the very 
same habit strength acquisition, then the 
critical test of the sEp quantification 


5 Not only the amount of food but the 
kind of food was different. Observations 
made by the author on a number of animals 
revealed that rats receiving cheese as reward 
in an experimental situation and dog-chow 
pellets as home-cage food will choose each 
type of food with about equal frequency when 
both are available in a neutral situation. This 
suggests that the difference between the two 
curves is attributable to the large difference 
in amount of reinforcement. 
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methodology as suggested by H.F.G. & 
Y. (6) fails. This follows from the 
fact that the sEpz curve for latency 
differs from the sEz curve for running- 
time. Both equation constants for the 
two curves differ but the heaviest dif- 
ference lies in the constant controlling 
the final limit (3.6 for latency asymp- 
tote against 4.7 for RT asymptote). 

3. The test of the methodology need 
not fail if the measures themselves are 
questionable. Three aspects of the 
measures may be questioned: (1) their 
objectivity, (2) their acceptability as 
measures of reaction potential, (3) their 
acceptability as measures of the same 
habit strength. The measures are ob- 
viously objective. They are both ac- 


ceptable as measures of reaction po- 
tential on the grounds of convention— 
they both have been widely used; on 
the grounds of evidence—a wide range 
of behavioral processes has been dem- 
onstrated by their use; and on @ priori 
grounds of generality—the more data 


that can be incorporated into a system, 
the better. But the third aspect may 
be seriously questioned. In fact, the 
runway habit has been analyzed more 
than once (1,12) into a chain of at 
least two components, or sub-habits. 
An alternative explanation presents 
itself, then, in place of that of failure 
of the methodology in question. La- 
tency and running-time may be re- 
garded as two different indices of two 
different habits within a chain of habits 
leading to a single terminal reinforce- 
ment. The first habit in the chain, 
coming out of the box, is measured by 
latency; the second member of the 
chain, traversal of the runway, is meas- 
ured by running time. 

4. The charge must be faced that 
this type of reasoning is circular: If the 
two measures give the same sE,p func- 
tion, they are two measures of a single 
habit; if not, they are two different 
habits. In this way the quantification 
methodology would always be saved. 
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The ad hoc nature of this reasoning 
is more apparent than real, because 
there are differential consequences of 
there being one or two habits involved. 
If the sEr quantification of two meas- 
ures yields different curves, and it is 
then asserted that there must be two 
different habits, then there should be 
observable differences between these 
two habits which can be tested. The 
two habits in the present case, for ex- 
ample, differ in their temporal and 
spatial distance from the goal. The 
consequences of one of two habits being 
farther away in time from a goal are 
known. The habit which has its rein- 
forcement delayed the longer will be 
the weaker. From this it can be simply 
deduced that the latency sE,p learning 
curve, representing the growth of the 
coming-out-of-the-box habit, will be ex- 
ceeded by the running-time sEp curve, 
since the latter represents a habit 
stronger by virtue of its closeness to 
the goal. This deduction is true. Fig- 
ure 3 shows the greater strength of the 
runway habit at every stage of learning. 
This is shown by the sEg curves only— 
not the empirical learning curves of 
Fig. 1. 

There will usually be many other dif- 
ferential consequences of the assump- 
tion of two habits. One may have 
greater inhibitional components and be 
expected to extinguish faster; or the 
two habits may have different gradients 
of stimulus and response generaliza- 
tion; or the serial position of the habits 
may make a difference. It would re- 
quire further experimentation with the 
runway to determine both the presence 
of these factors and their consequences. 
However, the difference of delay of re- 
inforcement is a necessary consequence 
of the physical construction of the ap- 
paratus, and the general effect is al- 
ready well established by the many 
delays of reinforcement studies (5, p. 
163). Hull’s theory accounts for more 
than the direction of superiority of the 


short-delay member of the chain. A 
rough quantitative prediction is per- 
mitted by the theoretical assumption of 
a direct relationship between the delay 
of reward and the asymptote of learn- 
ing. Inspection of the two sEp curves 
in Fig. 3 shows that it is predominantly 
the asymptote constant which distin- 
guishes the two functions. These find- 
ings add support to the analysis of the 
runway sequence into two functionally 
discrete habits. They also bear upon 
the goodness of the reaction potential 
quantification procedure as a_ valid 
method of providing comparable meas- 
ures of the two habits. Following 
quantification there appeared theoreti- 
cally expected differences between the 
habits which could not be tested with 
the non-comparable, empirical measures. 


SUMMARY AND CONCLUSIONS 


1. The reaction potential (sEr) quan- 
tification procedure set forth by Hull, 
Felsinger, Gladstone, and Yamaguchi 
(6) has been applied to two measures of 
performance of the same animals in the 
same learning situation to provide a 
test of the explicit assumption that such 
a procedure would yield a single func- 
tion from the two empirical learning 
curves despite any difference in ob- 
served form and magnitude, 

2. The empirical learning curves of 
the two measures—latency and running 
time—obtained from the Graham-Gagné 
type of runway were shown to be dif- 
ferent. A mathematical statement of 
this difference was given by the equa- 
tions found to closely represent the two 
learning curves. A power function 
fitted the latency data while an ex- 
ponential equation described the course 
of learning as measured by running- 
time. 

3. A number of the statistical as- 
sumptions underlying the quantification 
procedure were tested. They were con- 
sidered sufficiently tenable to warrant 
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comparison of the two sE, learning 
curves generated by the procedure. 

4. The two sEp learning curves were 
different. Exponential equations fitted 
to the two theoretical learning curves 
showed that they differed predominantly 
in respect to the constant controlling 
the asymptote of learning. 

The latency sEz curves also differed 
from that obtained with a different ap- 
paratus by Hull, Felsinger, Gladstone 
and Yamaguchi in being a much faster 
rising curve. MHiull’s theory can be 
easily adjusted to account for this dif- 
ference on the basis of a difference in 
amount of reinforcement. 

Such an explanation could not be 
made of the difference between the sEr 
curves for latency and running-time. 

5. The methodology under test can- 
not be said to have failed unless a pos- 
sible alternative explanation of the re- 
sults is ruled out. 

6. The alternative explanation states 
that the methodology is valid, but that 
the two measures are not of the same 
habit but of two different habits in a 
chain leading to a single terminal re- 
inforcement. A number of possible 
tests of this alternative explanation are 
stated. They involve the testing of the 
differential consequences of there being 
two rather than one habit. One such 
test was possible with the data on 
hand, and the test confirmed the likeli- 
hood of the alternative explanation by 
predicting the observed difference be- 
tween the two sEp curves on the basis 
of delay of reinforcement. 

7. While it is probably true, there- 
fore, that the data on hand do not pro- 
vide a critical test of the reaction po- 
tential quantification procedure, they 
do provide relevant confirming evidence 
of the utility of the procedure in bring- 
ing the empirical facts into closer re- 
lationship with a theory of behavior. 
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THE APPLICABILITY OF MOTIVATIONAL CRITERIA 
TO EMOTIONS 


BY R. H. WATERS AND D. F. BLACKWOOD 
The College of Wooster 


In a recent paper Leeper (15) sug- 
gests that, contrary to outside opinion, 
present-day psychology needs not more 
facts and less theory, but theory tai- 
lored to conform more closely to the 
facts which have already been accumu- 
lated. As an example of the sort of 
theoretical revision which is needed to 
keep psychology abreast of its growing 
body of factual material, Leeper ex- 
amines the treatment given the emotions 
in some of our widely used texts and 
argues that their treatment of the emo- 
tions as disorganized phenomena is ob- 
solescent. He shows that the terms 
“disorganized” and “disorganizing” are 


frequently used in a contradictory man- 
ner and that a careful survey of the 
factual material at our disposal shows 


them to be poor generalizations. Ac- 
cording to Leeper the term “disorgan- 
ized” has been incorrectly applied to 
data which actually show reorganiza- 
tion, organization along new lines. Be- 
cause they arouse and organize behav- 
ior along given lines, Leeper holds that 
the emotions operate as motives and 
should properly be included within the 
concept of motivation. 

Two other papers have since appeared 
bearing directly on his proposal. Duffy 
(11) applauds his stand and discusses 
some of the theoretical issues involved, 
such as the difference between emo- 
tional and non-emotional behavior and 
the problem of the nature of emotional 
processes which would set them off from 
other motives. Webb (22) also adds 
other objections to the current treat- 
ment of emotion and suggests a pos- 
sible theoretical orientation for the new 


view. That neither of these, though 
contributing significantly to the prob- 
lem, attack the question along the lines 
of the present paper will become clear 
as we proceed. 

The suggestion that emotions be in- 
cluded in the category of motivation de- 
mands an examination of the concept of 
motivation to determine whether such 
an inclusion can be justified. The pres- 
ent paper is an attempt at such an ex- 
amination. It will be restricted to the 
applicability of motivational criteria to 
emotional phenomena. 

It should be kept in mind from the 
outset that a concept is a symbol stand- 
ing in lieu of a group of specific refer- 
ents which share some common at- 
tribute. The concept “dog,” for in- 
stance, refers to a group of animals 
which have some things in common, but 
all of which are by no means identical. 
Thus in order to justify the inclusion 
of the emotions within the category of 
motivation, it is not necessary to dem- 
onstrate that the emotion of fear and 
the hunger drive are the same thing, 
but merely to show that fear is similar 
to hunger in the same way in which 
hunger is similar to sex need, that is to 
say, to show that they have comparable 
dimensions. 

Our first move must be to ferret out 
the criteria of motivation, to determine 
what common attributes hunger, thirst, 
sex need and the like have which cause 
us to think of them as belonging in the 
same category. These criteria may then 
be tried on the emotions and tested for 
goodness of fit. A study of the litera- 
ture reveals that there have been, in 
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general, two sets of criteria commonly 
applied to motives. According to Young 
(24) these may be designated the be- 
havioral and the physiological. 

Motivation is a common concept in 
psychology today and certainly an im- 
portant one. Why, then, should there 
be a difference of opinion as to its na- 
ture? The answer to this question is 
that motivation occupies the position of 
an intervening variable (12). It is not 
itself directly observable but is inferred 
to exist between two units of observable 
behavior in order to explain a relation- 
ship which seems to exist between them. 
The difference in interpretation depends 
upon the size of the unit which is ob- 
served. For example, according to the 
behavioral criterion, hunger may be in- 
ferred when (1) behavior is aroused, 
and (b) it is organized in such a way 
as to lead to the ingestion of food. 
Many psychologists who insist on a 
finer analysis of the behavior sequence 
will not admit the adequacy of this cri- 
terion. They will point out that when 
the rat is observed to enter the food 
box we are not justified in assuming 
that he is operating under hunger mo- 
tivation unless we also know something 
about his physiologica] condition. Thus, 
according to the physiological criterion, 
it is necessary’ to demonstrate (a) an 
atypical tissue condition, followed by 
(b) gross behavior oriented toward 
some goal, followed by (c) a change in 
the tissue condition. 

Some writers admit that the physio- 
logical criterion is adequate for what 
are called the biological drives but con- 
tend that it is inadequate for the dem- 
onstration of higher types of motiva- 
tion. What, they ask, is the tissue 


1 Those who hold strictly to the behavioral 
criterion will not admit the adequacy of the 
physiological criterion even in the area of 
the biological drives. It is beyond the scope 
of this paper to examine the validity of the 
physiological criterion. It is sufficient for the 
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condition characteristic of a desire for 


‘social approval? We have, then, a sort 


of double concept of motivation em- 
bracing simple biological drives which 
have a clear physiological basis and the 
so-called higher motives in which no 
such condition is objectively demon- 
strable. 

If a physiological approach is to be 
used in the area of higher motivation, 
then the following alternatives present 
themselves. Some may infer the pres- 
ence of the physiological condition and 
make no distinction between drive and 
motive. They may argue that the 
physiological basis of higher motives 
would be demonstrated were our tech- 
niques sufficiently refined. Others, pre- 
ferring a broader and safer course, may 
conceive the higher motives as growing 
out of biological drives through learn- 
ing. There is even further division 
among the latter with some, like Dock- 
eray (8),* considering the derivation to 
be direct and the learning processes 
involved rather simple. Others, like 
Allport (1), hold that the genesis is 
devious and the learning processes so 
complex that the resultant motives be- 
come functionally autonomous, com- 
pletely distinct from and independent 
of their roots in tissue need. The two 
foregoing examples have been utilized 
as illustrations of extremes along a con- 
tinuum of learning imposed upon drive. 
Not all positions have been so extreme, 
nor have they been completely con- 
sistent. Having determined the cri- 
teria of motivation and indicated the 
alternative positions with regard to the 
relation of drive to motive, we may now 


purpose of the present analysis to note that 
it is a widely held position. 

2 Dockeray’s position does not actually rep- 
resent the theoretical extreme of the con- 
tinuum, but approximates it about as well as 
any. 

3 Compare Dashiell’s treatment of motiva- 
tion in his 1928 edition (5) with that in his 
1937 edition (7). 
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proceed to an examination of the con- 
cept of emotion. 

Leeper’s protest is not the first state- 
ment of dissatisfaction with the concept 
of emotion as an independent category. 
Meyer (16), writing in 1933, drew an 
analogy between the concept of emo- 
tion and that of volition, and was so 
bold as to predict that the term would 
be cast out of our psychological vocabu- 
lary by 1950. (His time is growing 
short.) Shortly thereafter, in 1934, 
Duffy (9) suggested that “emotion” 
simply does not exist as a type of be- 
havior clearly discriminable from other 
types; and again, in 1941, seven years 
prior to the publication of Leeper’s 
paper, she (10) argued that in their ca- 
pacity to release energy in a given di- 
rection those phenomena which we call 
emotions are, indeed, a form of motiva- 
tion. 

In terms of the behavioral criterion 
it is not difficult to reconcile emotional 
phenomena with the concept of motiva- 
tion, nor is it inconsistent with some 
lines of traditional thought in psy- 
chology. Such an approach is gener- 
ally in harmony with the thinking of 
the functional psychologists, and lends 
itself rather well to the evolutionary 
emphasis for which they have been 
noted. From time to time a number 
of writers have considered the motiva- 
tional aspects of emotion according to 
the behavioral criterion. Such a con- 
cept has been outlined by MacDougall 


(18) who considered that emotional © 


behavior represented innate response 
patterns calculated to insure the sur- 
vival of the organism. A similar ap- 
proach is inherent in Cannon’s (3) early 
“emergency theory,” and in the several 
approaches based upon the biological 
concept of homeostasis. These last are 
somewhat closer to the physiological 
criterion, but lean heavily upon survival 
in the evolutionary sense. 

The behavioral criterion does not de- 
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pend upon the essential nature of the 
phenomena, but rather describes the 
way in which they operate. For ex- 
ample, we observe that an animal di- 
rects its activity toward the ingestion 
of food and infer that it is motivated 
by hunger. Similarly, we observe the 
animal retreats from a dangerous situa- 


_ tion and infer that it is motivated by 


fear, that is to say, that the emotion of 
fear has served to organize behavior in 
such a way as to remove or alter the ex- 
citing stimulus. With his emphasis on 
the fact that the behavior aroused by 
the emotions is organized and pur- 
posive, Leeper has aptly demonstrated 
that emotions may legitimately be 
thought of as motivation in the be- 
havioral sense, which defines motivation 
as that which serves to arouse, direct, 
and sustain activity in conjunction with 
some goal. 

Such broad approaches, it may be 
argued, lay themselves open to criticism 
because of their teleological leanings. 
They cannot be acceptable to those 
who insist upon a strict physiological . 
interpretation of motivation. For these 
psychologists motivation must be a mat- 
ter of here-and-now reduction of an im- 
mediate tissue need. Can it be shown 
that emotions meet the physiological 
criterion of motivation, that is to say, 
that they may be thought of as grow- 
ing out of physiological conditions and 
that these conditions are subsequently 
altered as a result of behavior? 

Although emotions as motivation do 
not appear to fit so readily the physio- 
logical criterion, there have, nonethe- 
less, been some suggestions made in that 
direction. This sort of inclination is to 
be noted in the writings of Woodworth, 
who says: 


“Anyone will unhesitatingly classify as 
emotions: anger, fear, disgust, joy and sor- 
row; and as states of the organism: hunger, 
thirst, nausea, fatigue, drowsiness, intoxi- 
cation. Now that physiology has revealed 
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a peculiar organic state in fear and anger, 
why do we continue to call them emotions 
and deny that name to fatigue or drowsi- 
ness?” (23, p. 234.) 


Woodworth has thus indicated that 
some emotions are comparable, physio- 
logically, to other states commonly 
called drives. Cannon (4), writing in 
1922, catalogued the tissue changes in- 
volved in pain, hunger, fear and rage. 
The inclusion of hunger among the con- 
ditions we call emotions appears par- 
ticularly significant, for is not hunger 
one of our more commonly referred to 
biological drives? 

In terms of Dockeray s treatment of 
motivation derived from drive, an ap- 
proach fairly common in our texts, the 
emotions can be fitted within such a 
framework. Dockeray views a drive 
as a tissue condition which stimulates 
the organism to random activity until, 
through trial and error, it hits upon 
some response which serves to alter the 
tissue need. This approach is wholly 
consistent with the usual genetic de- 
scription of the development of the 
hunger motive according to which the 
organism receives sensory impressions 
from the stomach, is stimulated to a 
wide variety of activity which brings it 
into contact with food, the exciting 
stimulus (tissue condition) is modified, 
and the organism, if it be a human one, 
learns to give the sensory impressions 
from the stomach the name “hunger.” * 
When, through learning conceived as 
being of a rather simple sort (condition- 
ing), the organism comes to associate 
the specific tissue need with a specific 
response which will senve to reduce it, 


4It will be seen that this represents an 
oversimplification of the mechanisms involved 
in the hunger drive whose physiological com- 
ponents are known to include much more 
than stomach contractions alone. Stomach 
contractions are here used as an illustration 
of the sort of tissue condition which Dockeray 
considers to furnish the drive. 
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it is called a motive. Motive, then, is 
equal to biological drive plus relatively 
simple learning. There is some experi- 
mental evidence to indicate that emo- 
tions as motives can be similarly inter- 
preted. Miller (17), working with rats, 
used an induced fear as a presumably 
motivating condition in a learning situ- 
ation and found that it did indeed op- 
erate as a motive in that it (a) pro- 
duced trial and error activity, and (b) 
served to reinforce the correct response. 

Obviously, further research of a simi- 
lar nature is indicated. While the out- 
look, on the basis of this one experi- 
ment, is promising, it may be well to 
look to some of the theoretical implica- 
tions of such an inclusion. Consider- 
ing emotions as motivation within this 
framework may afford us some insight 
into the existential nature of the emo- 
tions themselves. Recall that for Dock- 
eray the drive which directly produces 
the motive is a tissue condition. The 
motive then becomes, in a sense, the 
organism’s interpretation of a specific 
tissue need. Fitting emotions to such 
a framework is tantamount to an ac- 
ceptance of the James-Lange theory of 
emotion (13, 14), which holds that an 
emotion is the perception of certain 
bodily changes, or, in more behavioristic 
terminology, an emotion is the name 
which we learn to give to a particular 
pattern of sensory impressions. It need 
hardly be pointed out that this theory 
is not currently in good repute since it 
implies that patterns of internal stimu- 
lation in the several emotions must be 
discriminable one from another and 
since no such discriminable differences 
have been satisfactorily isolated. If 
there are no discriminable differences in 
patterns of stimulation afforded by the 
physiological components of the vari- 
ous emotions, then we have a situation 
somewhat analogous to that found in 
the field of higher motiva’'»n. Clearly, 
if Leeper’s idea of emoti ~ as motiva- 
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tion is to be adopted in conjunction 
with a concept of motives directly de- 
rived from tissue need, then further re- 
search is needed in the physiology of 
the emotions, calculated to demonstrate 
discrete patterns. This is an area which 
has already received considerable atten- 
tion and, as has been noted, the results, 
generally, have been negative. 
Fortunately there is another alterna- 
tive which, while it compromises to 
some extent the strict physiological 
viewpoint, effectively circumvents the 
difficulties inherent in Dockeray’s ap- 
proach. This alternative invokes All- 
port’s idea of functional autonomy. 
Here motives may also derive from 
biological drives with a basis in tissue 
need, but the learning processes in- 
volved are so complex that the result- 
ant motivation is completely independ- 
ent of its physiological roots. The em- 
phasis in functional autonomy is upon 
the divorcement of the motive from the 
tissue need involved in its genesis. 
Thus, when a motive has, through com- 
plex learning, become autonomous, it is 
to all practical intents and purposes 
psychogenic and one is no_ longer 
obliged, when dealing with higher order 
motives and those areas of emotion 
which are hazily defined, to demonstrate 
the present operation of an atypical 
tissue condition. Hence, as Leeper has 
held, the theory of emotion which says 
that emotions are, indeed, psychogenic, 
and which Munn (19) considers too 
naive for serious treatment, may in the 
final analysis be the least naive of all 
the theories which have been advanced. 
It may be argued that in invoking 
functional autonomy to circumvent the 
need for the establishment of specific 
tissue conditions in all cases, we have 
left the strict physiological criterion 
and have arrived by indirection at a 
behavioral approach. The only answer 
to this must be that any theoretical area 
in psychology must be broad enough to 
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cover all of the involved phenomena. 
As Hull (12) has said, when theory 
and fact conflict, theory must always 
give way. 

From the foregoing review of the 
status of emotion and motivation, it ap- 
pears that the concept of emotions as 
motivation can be conditionally recon- 
ciled with a physiological approach to 
motivation, the conditions being these: 
In order to insure consistency those 
who hold to the physiological criterion 
of motivation will be faced with the 
alternative of either (a) instituting 
more intensive research with an eye to 
the establishment of definite patterns of 
atypical tissue conditions both in the 
so-called emotions and in the area of 
higher motivation, or (b) expanding 
their theoretical framework to include 
the concept of functional autonomy. 
Acceptance of the Leeper hypothesis 
will serve to combine two vague areas 
of investigation into one and offer posi- 
tive suggestions for research in this one 
area which may lead not only to clarifi- 
cation of the phenomena in question, 
but also to a narrowing of the gap be- 
tween those who approach the problem 
from different directions. 

The lines of research indicated by 
such a concept embrace not only in- 
vestigation of the physiological com- 
ponents of motives and emotions but 
also the operational aspects of both. 
We speak, for instance, of levels of 
motivation which are presumed to vary 
directly with goal deprivation. This 
raises the question of the possible quan- 
tification of emotions along similar lines. 
Again it has often been noted that ex- 
treme emotion appears to operate in 
contradiction to a motivational inter- 
pretation. At the same time there has 
been some suggestion that a similar 
situation exists in motivation (20). 
Too high a level of motivation may in- 
hibit goal-directed activity. 

Further consequences of the adop- 
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tion of Leeper’s hypothesis will prob- 
ably involve such problems as the 
reconciliation of the genetic develop- 
ment of emotions according to Bridges 
(2) with some scheme of the genetic 
development of motives, a treatment of 
Dashiell’s (6) suggestion that emotions 
may be nothing more than socially de- 
termined interpretations of environmen- 
tal conditions, and investigation in the 
area of the acquisition of new motives 
and emotions. 

Of course, the final word on the in- 
clusion of emotions within the category 
of motives must rest with the experi- 
mental findings in the above indicated 
areas. In terms of the narrower topic 


assumed by this paper, however, it ap- 
pears that the hypothesis may be tenta- 
tively accepted. 
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THE NATURE AND FUNCTIONING OF WANTS * 


BY EDWARD C. TOLMAN 
University of California 


When we examine the lists of mo- 
tives, or wants, cited either by com- 
mon sense or by the system-makers in 
psychology, we find items which seem 
to fall into either one or the other of 
two extreme categories. On the one 
hand, we find lists of the so-called 
“basic drives” or “needs,” such as: 
hunger, thirst, sex, aggression, need for 
affiliation, need for dominance, and the 
rest. Or, on the other hand, we find 


statements about very specific and im- 
mediate wants such as: the want for a 
piece of candy when a box or dish ap- 
pears, the want for a cigarette upon see- 
ing a package handy, the want to get 
out of the range of a given too hot fire, 


etc., etc., etc. 

Furthermore, corresponding to the 
emphasis on the one or the other of 
these two kinds of item, we find two 
theoretical points of view. There is the 
type of theory which stresses the im- 
portance of the basic drives or needs as 
the only fundamental, energy or ten- 
sion providing sources in the organism. 
This type of theory seeks to derive all 
the immediate, more situationally de- 
scribed, wants from these basic drives 
or needs. The theorists whose names 
suggest themselves in this connection 
are, of course, Freud (4), McDougall 
(16), Murray (19), Sanford (23), Mas- 
low (18), Goldstein (5), and also Hull 
(9). And secondly, there is the kind 
of theory (the names which suggest 
themselves are Koffka [11], Lewin 
[14], Krech [13], Crutchfield [13], 
MacLeod [15]) which tends to take the 

1 Presented to the graduate students of the 


Universities of Iowa, Chicago, and Michigan 
and of Western Reserve University. 


immediate situational want as its para- 
digm and conceives the functioning of 
the basic drives and needs as merely 
highly charged instances of immediate 
situational wants. This latter type of 
theory stresses the immediate environ- 
mental context. And, whether the want 
in question be that of getting to food 
or to a sex-object or merely that to 
hang a crooked picture straight or to 
pick a pin up off the floor, the impor- 
tant determiners as seen by this second 
sort of theory, which I shall call the 
“neo-Gestaltist” theory, lie in the “dy- 
namics” of the immediate “phenomeno- 
logical” field. The basic laws of mo- 
tivation are believed by this second 
group to be analogous to the Gestalt 
laws of perception—such laws, say, as: 
“Closure,” “Pragnanz,” “Better Figure” 
or the like. 

The point of view to be defended in 
this paper is a compromise one. I 
would hold with the neo-Gestaltists that 
the functioning of a want, as an im- 
mediate event, is always an activity 
going on in what may be called an im- 
mediate “phenomenological” or, as I 
would prefer to conceive it, “behavior” 
field—in, that is, the present situation 
as immediately registered by the or- 
ganism as a set of behavior possibilities. 
(See Fig. 1.) And this I would hold 
to be true whether the given instance 
be a case of hunger, or of sex, or of the 
mere opening of a door or of the mere 
removing of a speck from the floor. 
The immediate process in each instance 
would be some sort of a “closure”-like, 
or “better figure”-like phenomenon tak- 
ing place in the immediately registered 
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PERSONALITY STRUCTURE 
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Behaving Self 


BEHAVIOR FIELD 


Contains: 


Drive-compartments (with 
more or less tension in them 
at the moment) ; goal -object 
and subgoal-object comport- 
ments; channels (cathexes 
and beliefs) between drive- 
compartments and goal and 
subgoal compartments, and 
resulting values in these 
latter compartments. 


G, and G, are appre - 
hended goal-objects 
with plus valences. 


Fic. 1 


—i.e., apprehended—behavior field.2 A 
valenced goal is registered, apprehended, 
or valenced goals are apprehended, as 
located there or there in the field. And 
the organism apprehends also what I 


2 From here on I shall, because I think it is 
more euphonious, substitute “apprehended” 
for “registered.” But by “apprehended” I do 
not mean a necessarily conscious process any 
more than I would by the term “registered.” 
“Apprehended” will mean, as I use it, some- 
thing grasped by or registered on the nervous 
system and from my point of view such a 
registering or apprehension may be, and I 
suspect quite often is, wholly unaccompanied 
by consciousness. 


shall call a “need-push” in his “behav- 
ing self’—this behaving self with its 
need-push is likewise apprehended and 
apprehended as located at its own dif- 
ferent place in the behavior field. The 
apprehended need-push pushes the ap- 
prehended behaving self towards the 
apprehended valenced goal or goals. 
The result is a bringing of the behav- 
ing self with its need-push and the va- 
lenced goal or goals nearer together 
either by the organism’s actually loco- 
moting towards the goal or (if locomo- 
tion proves impossible or not permitted 
or if the motivation be irrelevantly en- 
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hanced) by some sort of a cognitive 
distortion of the apprehended distance 
between the behaving self and the goal.* 

However, while I would thus agree 
that the functioning of any want, 
whether basic or superficial, always 
obeys some final “dynamic” field laws, 
I would also hold that for the com- 
plete understanding of why a want ap- 
pears and functions in a given case 
there have to be considered also the 
peculiar physiological, glandular, nu- 
tritional and past training effects act- 
ing upon the particular individual at 
the particular time. And it is when 
one turns to these individual, often his- 
torical, determiners that one finds it 
useful to conceive not only of immedi- 
ately apprehended behavior fields but 
also of more enduring factors in the 
organism which I would summarize un- 
der the concept of a personality struc- 
ture. Such a’ personality structure 


would be a complex of underlying 
“drives” or “needs” plus attachments 


of these drives or needs (as a result 
of past experience) to goals and sub- 
goals. (See Fig. 1.) In short, one 
usually has to consider a relatively en- 
during personality structure with its in- 
lying drives, drive-attachments and re- 
sultant values to explain why in any 
given instance the need-pushes and the 
valences in the given behavior-field are 
what they are. Individual constitution, 
individual history and present-acting 
metabolic or glandular conditions seem 
to be especially important for determin- 
ing the strengths of present drives, the 
strengths of the attachments of such 
drives, and the final resultant values. 

I would assume, that is, some list of 
innate drives or needs, although I do 
not know now just what this list will 


8In this connection we may recall the 
C chfield experiments (1) in which rats 
aicc. being trained to run a certain distance 
under one degree of hunger overshot the mark 
when their hunger was increased. 
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finally turn out to be. Each basic drive 
or need I would suppose to have differ- 
ent average strengths for different indi- 
viduals due to differences of innate con- 
stitution * and due also, perhaps, to dif- 
ferences of early training. I would 
assume further that in any one indi- 
vidual a given drive may suffer (some 
types of drives more than others) pe- 
riodic enhancements and diminutions 
due to varying nutritional and glandular 
conditions and disturbances. Such tem- 
poral variations in the strengths of the 
drives are particularly obvious in the 
case of what have been called by Mur- 
ray (19) the “viscerogenic needs”—+.e., 
hunger, thirst, sex, and the like. Such 
viscerogenic needs or viscerogenic hun- 
gers exhibit cyclical rhythms depend- 
ent upon physiological processes and 
upon the interactions of environment- 
ally controlled maintenance-schedules 
with these physiological processes. For 
example, we in America, as a result 
of physiology plus culturally imposed 
maintenance-schedules, normally tend 
to become food-hungry three times a 
day. And what holds for food-hunger 
tends to hold in somewhat analogous 
fashion for the other viscerogenic hun- 
gers, such as: thirst, sex, maternal drive 
(in lactating mothers), toilet-seeking, 
and the rest. 

My next point would be that the 
drives are attached, “channeled,” or as 
I shall call it from now on “cathected,” 
to specific types of final goal-object 
either innately or as a result of experi- 
ence. Further, I shall hold that addi- 
tional channelings develop between such 
types of potential final goal-objects and 
still other types of objects as possible 
means or subgoals. These latter chan- 
nelings I shall term “beliefs” (mostly 
instrumental in character) and I shall 


4See Calvin S. Hall’s experiments on breed- 
ing strains of rats with different average 
strengths of timidity and different average 
strengths of aggressiveness (6, 7). 
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maintain that it is the cathexes and 
beliefs constituting relatively simple or 
relatively long and complex chains of 
connections to the underlying drives 
which give rise finally (together with 
the presented stimuli plus the laws of 
perception) to the apprehensions of 
need-pushes, of valences, and of direc- 
tions and distances in any actual be- 
havior field. 

Return, for the moment, to the situa- 
tional wants, which are emphasized by 
our neo-Gestaltist friends—such wants, 
say, to list a few more, as those to 
“sharpen a pencil,” to “unravel a piece 
of string,” to “sing a specific tune,” to 
“select and buy a new shirt,” and the 
like. All such wants are conceived by 
me as having to be traced back in the 
last analysis through cathexes and in- 
strumental or other types of belief to 
underlying drives or needs. However, 


I hasten to add that such a tracing back 
will often be relatively difficult or even 


“(for lack of complete historical knowl- 
edge concerning the individual) to all 
intents and purposes impossible. Fur- 
thermore, such a tracing back, even if 
achieved, may well yield but little use- 
ful information. For, in most everyday 


instances, practically all the individuals- 


in a given culture will have very similar 
maintenance-schedules, will have been 
exposed to the same learning processes 
and hence will have all arrived at prac- 
tically the same sets of cathexes and 
instrumental and other beliefs, and will, 
therefore, respond in most ordinary 
situations with practically the same re- 
sultant need-pushes and valences. Put 
any middle class college male in a store 
before an array of shirts and he will 
behave practically like every other 
middle class college male. That is, his 
behavior can, for all practical pur- 
poses, be considered as governed wholly 
by the immediate situation and not by 
any deep-lying drives and cathexes and 
instrumental or other beliefs. Given a 
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common culture and typical individuals 
in that culture and the everyday goals 
and subgoals of all such individuals will 
be practically identical. Any attempted 
examination of the specific drives and 
cathexes and beliefs underlying one of 
these individual’s choices may well, 
therefore, seem superfluous and silly. 
For practically all variations in the 
behaviors of these individuals will be 
correlated primarily with differences in 
the environment. There will be no 
need of taking into account basic drives 
and cathexes and beliefs. For these will 


be the same for all such individuals. 


However, put very different sorts of 
persons into the same situation and a 
quite different picture appears. Thus, 
if we introduced into our haberdasher’s 
an unusually rich student, or an unusu- 
ally poor one, or a visiting chief from 
a native culture, or a very sexually 
aroused, showoff male, then each of 
these might well tend to select very dif- 
ferent sorts of shirts. And it would ap- 
pear at once that in order to explain 
these differences of preference (i.e., of 
resultant valence and of need-push) in 
these different men, it would be neces- 
sary to consider not only the objective 
environmental situation, which would 
be practically identical for them all, but 
also the differences in their basic drive 
strengths at the moment, and in their 
individual histories which would have 
produced different cathexes and differ- 
ent beliefs attached to such drives. 

But let us now analyze “cathexes” 
further. The term “cathexis” has, of 
course, been borrowed from Freud (4). 
I have, however, somewhat departed 
from his usage. By a cathexis I shall 
mean not the resultant loading of the 
given type of goal-object by the drive 
(this loading I shall call value, see be- 
low) but rather the innate or acquired 
connection (or channel) between the 
given drive, say a viscerogenic hunger 
(such as food-hunger or thirst-hunger, 


| 


THE NATURE AND FUNCTIONING OF WANTS 361 


or sex-hunger) and the given type of 
goal-object. The result of such a 
cathexis (i.e., such a channel) will be 
that when the given drive is in force 
and this cathexis channel is deep, that 
is, highly permeable, there will be a 
flow of energy from the drive to the 
compartment for the given type of goal- 
object. And this will result in what I 
call a corresponding positive value for 
that type of goal-object. Such a posi- 
tive “value” will bring it about that 
when it, this value, is high, the given 
individual will at that moment be es- 
pecially likely: (1) to apprehend in his 
behavior space any concrete instance 
of such a goal-object provided by the 
environment; and (2) to endow such 
an instance with positive “valence.” It 
will be noted that I am using “value” 
to designate the positive readiness for 
a type, or class, of objects, and that I 
am using “valence” to designate the re- 
sultant positive readiness to go to a spe- 
cific instance of such a class of objects 
when such a specific instance is appre- 
hended in a particular behavior space. 
A value is carried around by the organ- 
ism. A valence is a specific embodi- 
ment of this value in an apprehended 
concrete behavior space. 

Consider next “beliefs” and for the 
present primarily instrumental beliefs. 
It is my contention that the organism 
in addition to carrying around in his 
personality structure attachments. of 
types of goal-object to drives—e., 
cathexes—also carries around attach- 
ments of types of subgoal to types of 
final goal or to types of further sub- 
goal. It is all these attachments, or 
channels, which I call “instrumental be- 
liefs.’” These “instrumental beliefs” 
are, I believe, primarily a product of 
learning. But further experimentation 
needs to be done. Examples of some 
of the probably millions of common, 
everyday instrumental beliefs which 
practically every middle-class Ameri- 


can has learned and carries about 
with him might be: “restaurants and/or 
kitchens lead to food,” “schools lead to 
education,” “the paths of speeding 
automobiles lead to injury,” “gas sta- 
tions lead to toilets,” “electric switches 
lead to lights,” and so on and so on and 
so on. 

In addition, however, to (or perhaps 
better as a sort of encompassing matrix 
for) many such relatively specific in- 
strumental beliefs there would be still 
more highly generalized, more abstract 
and in human beings eventually codi- 
fied and highly elaborated sets of be- 
liefs or rules which an individual in- 
herits or acquires. These would be gen- 
eral laws, such as those of space, time, 
quantity, causation, verbal logic, etc. 
Even animals have some such sub- 
stratum of such general innate or ac- 
quired rules about space, time, quan- 
tity, and quality. The animal’s general 
rules are simpler than ours and are not 
elaborated into mathematical or logical 
systems. But, nevertheless, a consid- 
erable part of what we seem to be in- 
terested in doing in our animal studies 
is in discovering just what general no- 
tions, rules, i.e., “instrumental beliefs,” 
as to space, time, quantity, gravity, and 
qualities our rats and chimpanzees do 
have or are capable of acquiring.® 

But consider now, more specifically, 
the motivational significance of such 
rules or instrumental beliefs, general or 
specific. My proposition is that when- 
ever the far-terminal of an instrumental 
belief is positively valued, the near- 
terminal of the belief becomes by virtue 
of this belief (i.e., channel) also posi- 
tively valued. In other words, instru- 
mental beliefs are similar to cathexes 


5 What I am here calling “general” instru- 
mental beliefs I called in a previous article 
(27) “field cognition modes.” These general 
beliefs or field cognition modes would also 
seem to be closely related to what Harlow 
(8) has recently called learning sets. 


= 
¢ 
4 


362 


in that they are to be conceived as in 
the nature of channels through which 
values may be conducted. Whereas a 
cathexis may be said to be a final in- 
strumental belief which operates to 
bring it about, that if some type of 
drive-relief is positively valued, a cer- 
tain type of final goal-object will there- 
fore also be positively valued, an in- 
strumental belief (or it may be that a 
whole nexus of instrumental beliefs or 
a belief-set) operates to bring it about 
that if some such type of final goal- 
object is positively valued, then such 
and such types of instrumentally con- 
nected subgoals will also be valued. 
By this time the reader must have 
become restive and be asking himself 
whether I have any possible objective, 
operational ways of defining these con- 
structs of drives, values, cathexes, be- 
liefs and the other constructs which I 
have not yet talked much about, of 
apprehended behavior fields, behaving- 
I do 


selves, valences and need-pushes. 
not pretend that I have any good final 
definitions. I can, however, suggest the 
directions in which, I think, such defini- 
tions should be searched for. 


(1) Drives (or needs). The opera- 
tional definition of a drive, or need, is 
to be found, I believe, in the propensity 
of an individual to perform a character- 
istic type of consummatory response. 
Thus, food hunger would be defined by 
the propensity for eating, thirst by that 
for drinking, sex by that for the sex 
act, dominance by that for performing 
the act of dominating over other indi- 
viduals, maternal drive by that for the 
act of nursing or otherwise caring for 
the young, curiosity by that for the 
acts of investigating and examining, 
and so on.° Thus one possible em- 


6 This fact that I am identifying drives by 
the character of consummatory responses, 
means, I hope, that I am escaping the pitfall 
of finally ending up with an innumerable 
number of drives. For there are, I believe, 
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pirical technique for identifying, and 
measuring the strength of, any such 
drive or need would consist in putting 
the individual in a “standardized situa- 
tion” in the direct presence of a “stand- 
ard” goal-object appropriate for the 
given drive and then noting the appear- 
ance of the consummatory response 
itself and measuring it quantitatively 
by its vigor, duration, frequency, la- 
tency, or the like. A second possible 
technique for measuring the strength of 
the drive would consist in putting the 
individual in a situation with a stand- 
ardized distance, or obstacle, between 
him and the standard goal-object and 
measuring some quantitative aspect, 
again such as frequency, latency, dura- 
tion, etc., not of his consummatory re- 
sponse as such, but of his approach-to- 
the-goal response. It is, of course, 
this latter approach technique which is 
used by the Columbia Obstruction Box, 
whereas it seems to be rather the 
strength of the consummatory response 
itself which was used by Wiesner and 
Sheard (29) in their counting of the 
number of retrievings of young to the 
nest which were carried out by a 
mother rat; and also by Maslow (17) 
and Nowlis (21) when they rated 
dominance in the monkeys and chim- 
panzees by pairing each animal with 
every other and observing in each such 
pair which animal dominated over the 
other in the situations of getting to 
food or to the home cage. 

Of course many problems and diffi- 
culties, involved in such attempts to 
measure the strengths of drives by 
strengths of consummatory responses 
or of approach responses, immediately 


only a relatively limited number of distinc- 
tive consummatory responses. There will not 
be as many drives as there are types of goal- 
object but only as many drives as there are 
distinctive different ways of behaving to goal- 
objects. And I believe we may some day be 
willing to settle for some number not much 
more than twenty. 
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arise. There is the problem of scale- 
units. Do twenty crossings of the grill 
in twenty minutes indicate a drive 
which is twice as strong as one which 
evokes only ten crossings in twenty 
minutes? But this problem of the 
equality of scale-units is one which has 
plagued all attempts at psychological 
measurement and is not peculiar to 
such measurements as these of drives. 
It is equally a problem for the hard- 
headed psycho-physicists and the hard- 
headed mental testers. . 

A second type of difficulty arises out 
of whether the conditions introduced 
into the measuring situation itself— 
such as, for example, the use of an 
electric grill in the Columbia Box— 
may not introduce an interacting and 
hence falsifying factor. Again, how- 
ever, this is a question which can be 
attacked and eventually solved only by 
thinking up other ways of measuring. 
It is the question as to whether, or in 
how far, the nature of one’s measuring 
instruments and procedures themselves 
enter into and distort the measure- 
ments. This is a problem for physics 
as well as for psychology. (Consider, 
for example, the Heisenberg Uncer- 
tainty Principle.) We can make our 
final decisions about it only by invent- 
ing other types of measuring stick and 
seeing whether or not they give the 
same or seemingly better, more mean- 
ingful, results and concepts. 

Yet again, another difficulty, espe- 
cially in the case of human beings, is 
that these latter tiresome animals seem 
to be able to cover up, repress, inhibit, 
or distort their drives so that when you 
put them in the presence of the stand- 
ard goal-object they refuse to go to or 
consume it, in any obvious, overtly 
measurable way—in spite of the fact 
that you know that they do suffer from 
the given drive or need. Thus, for ex- 
ample, one can put a hungry, but 
overly well-mannered child in the pres- 


ence of food, and he may refuse to eat. 
And it is a commonplace that what can 
thus be brought about in our culture by 
social probibitions in the case of hunger 
can be brought about much more seri- 
ously in the cases of sex and aggres- 
sion. It is, of course, in the attempt to 
meet such difficulties that the projective 
techniques have been invented; but as 
ways of measuring momentary drive 
strengths these projective techniques 
are, of course, still far from satisfac- 
tory. Nevertheless, they do offer a 
beginning (see, for example, the ex- 
periment of Sanford [24] which de- 
veloped a projective test for measuring 
immediate hunger). 

(2) Turn now to our next concept— 
that of values. A positive value (and 
in the present paper I am considering 
positive values only, although a com- 
plete discussion would necessarily in- 
volve negative ones as well) I would 
measure operationally by the magni- 
tude of the propensity to get to a spe- 
cific type of goal object. Some quan- 
titative aspect of this getting-to would 
constitute my measure of a positive 
value. But it will be remembered that 
such a measuring of the magnitude of 
the getting-to response was one of the 
techniques just suggested for measur- 
ing the strength of the drive. Hence, 
in the case in which the type of ob- 
ject in question is the standard goal-ob- 
ject, the quantitative magnitudes of the 
strength of the drive and the quantita- 
tive magnitudes of the positive values 
of the goal-object will be one and the 
same. (The significance of this will be 
clearer in a moment, I hope.) 

Both concepts, that of drive and that 
of value of goal-object, are needed in 
order to allow for types of goal which 
evoke weaker approach responses than 
does the standard goal. The hungry 
rat approaches a standard food with a 
certain degree of vigor. This will opera- 
tionally measure the strength of his hun- 
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ger drive. He approaches other foods, 
however, with either less or more vigor. 
These latter foods will have smaller or 
larger positive values given the same 
strengths of drive. The strength of the 
hunger drive at any given moment is 
defined by the vigor with which the 
individual will approach the standard 
food. And the vigor with which he ap- 
proaches any non-standard food will 
measure his value for this other non- 
standard food. 

Consider also the sort of Case in 
which a given type cf object serves as 
goal for more than one drive. The 
strength of the approach propensities 
(the value) of such an object may then 
prove exceptionally strong. For ex- 
ample, a given kind of food may have 
prestige value in addition to hunger 
value and hence be gone to more 
strongly than the standard food. A 
child’s value for ice cream may per- 
haps be an example of this. Ice cream 
may be approached both as food and 
as a socially approved prestige goal. 

(3) Consider next operational defi- 
nitions for cathexes and for instrumen- 
tal and other beliefs. These are all, as 
I have said, to be conceived as hy- 
pothetical channelings between succes- 
sive value compartments in the person- 
ality structure. These channels act so 
that when one compartment is filled 
with value any other compartment con- 
nected to it by such a channel also tends 
to become filled. 

A. Cathexes. A cathexis is a chan- 
nel between a drive-subsidence com- 
partment and a goal-object compart- 
ment. Our immediate task is how to 
arrive at an operational definition and 
measurement of such a channel. It is 
to be recalled that we suggested that 
we could define and measure the 
Strength of a drive by measuring the 
getting-to responses relative to the 
“standard” goal for that drive. An- 
other way to put this is to say that, by 
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definition, I am asserting that the de- 
gree of cathexis for a standard object 
always has the value of “one.” For, if 
we plot different magnitudes of the 
drive as strengths of getting to a stand- 
ard goal as our abscissae and plot the 
corresponding same magnitudes as the 
positive values ut the standard goal as 
our ordinates, we would necessarily 
(since both sets of figures will be the 
same) get a straight line at a 45° 
angle. And the tangent of this angle 
would have the value of one. In other 
words, I shall define operationally the 
strength of the cathexis to any type of 
goal by the tangent of the angle made 
by the line resulting from plotting the 
magnitudes of the measures for getting- 
to such a goal against the magnitudes 
of the measures for getting-to the stand- 
ard goal. I shall assume, further, that 
goals other than the standard may be 
cathected either more or less strongly. 
Hence the slopes of the lines for those 
other goals will be greater or less than 
45° and the tangents of the angles cor- 
responding to them either greater or 
less than one. To recapitulate—the 
standard goal will, by definition, have 
a cathexis of one and other types of 
goals will have cathexes either greater 
or less than one. 

B. Instrumental beliefs. An instru- 
mental belief I conceive as in the na- 
ture of a channel (or possibly a set of 
channels) between the compartment for 
a type of goal and that for a type of 
means or subgoal, or as a channel be- 
tween two successive subgoals. If the 
more final one of these goals or sub- 
goals has positive value, the nearer, less 
final one, will also have positive value 
conducted to it depending upon the 
strength of the belief (i.e., the good- 
ness of the channel). How do we op- 
erationally discover and measure such 
beliefs? Theoretically the answer is 
simple. Practically, however, given the 
complexity of the human animal and 
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its covering-up capacities, the uncover- 
ing of a man’s beliefs and the attempt 
to measure them is usually a very tricky 
business, for which at present we have 
mostly only relatively indirect and fal- 
lible procedures, such as interviews and 
projective techniques. Theoretically, we 
can proceed after the analogy of ca- 
thexes. We can discover that the posi- 
tive values of object Type X (that is, 
the propensity to get to this x type of 
object) go up and down with the posi- 
tive values of object Type Y. We plot 
the one set of values against the other 
and get the slope of the curve. This 
slope will usually vary from zero to a 
maximum of 45 degrees, in which latter 
case the instrumental belief (i.e., “that 
x’s are the way to get to y’s”) may be 
said to be 100 per cent, in the sense 
that the organism believes completely 
and absolutely that the reaching of an 
object of Type X will lead him to an 
object of Type Y.” 

But any such certain and neatly iso- 
lated instrumental beliefs are probably 
very rare in human beings. Rather, 
given our shifting experiences and our 
complicated personality structures, it 
becomes very difficult to uncover what 
are the manifolds of instrumental be- 
liefs upon which, for example, your 
and my respective positive values for 
the study of psychology depend. De- 
tailed case histories, projective tech- 
niques, and depth interviews would all 
have to be tried, and even then the 
final plotting out of your and my re- 
spective belief systems would still be 
largely a product of clinical insight and 


7 The intriguing possibility also suggests it- 
self that fixations, stereotypes, and prejudices 
may perhaps be considered as beliefs that are 
more than 100 per cent strong. In such cases 
the values for x’s would be stronger than the 
values for y’s even though they depend upon 
the latter. Finally, it may be noted that 
what I am here calling “instrumental beliefs” 
I formerly called “means-end readinesses” 
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intuition. The situation would be the 
same as that in which students of 
prejudice and ethnocentrism have found 
themselves involved. They have dis- 
covered (see, for example, the recent 
articles by Else Frenkel-Brunswik [7] 
and Frenkel-Brunswik, Levinson and 
Sanford [3]) that in seeking to dis- 
cover why some individuals have strong 
negative values for minority groups and 
others have positive values for them, 
they, the investigators, had to uncover, 
by all the depth methods at their com- 
mand, the values of their subjects with 
respect to a wide range of other entities, 
especially their plus and minus values 
with respect to parents and authority 
figures, in-groups and out-groups, sex- 
roles, manners, crime, astrology, acts of 
aggression, etc., etc., etc. And, then, 
the final spelling out of the instrumental 
or other belief-connections between the 
values for all these other entities and 
the resultant positive value for majority 
groups seemed to be a matter of good 
sense and intuition and not one as yet 
capable of precise experimental con- 
firmation.® 

C. Finally, at this point, I need to 
introduce an additional concept, that 
of equivalence beliefs. (See also [27].) 
For it would appear that when we try - 
to untangle the nexus of connections by 
which such a positive value as that for 
“being a psychologist” or that for “one’s 
own group as contrasted with minority 
groups” is determined in a given indi- 
vidual, we seem oftentimes to find that 
the value for one type of entity derives 
from that for another type of entity not 
as a result of a true instrumental rela- 


8 At this point it should again be empha- 
sized that a complete statement of the doc- 
trine I am presenting would require the in- 
clusion of negative values (for example, the 
negative values for minority groups). I have 
had to leave a discussion of negative values 
out of this account for lack of time. But my 
system with a little more elaboration can 
really take care of them. 


366 Epwarp C. 


tionship but rather from the acceptance 
of an identification, i.e., an equivalence, 
between the two. The operational evi- 
dence for such an equivalence belief 
would be found (1) in the fact that, as 
in the case of an instrumental belief, 
the positive value for the one type of 
entity would be found to go up and 
down with that for another type of 
entity, but also (2) in the fact (not 
true for an instrumental belief) that 
when an instance of the equivalent 
entity is reached, there would be a re- 
duced remaining propensity to go on to 
an instance of the original entity to 
which it is equivalent. The equivalent 
would, in short, be accepted, as in some 
degree, a substitute for that for which 
it is taken as equivalent, in a way in 
which a mere subgoal or means is not 
accepted as a substitute for the final 
goal. , 


This last point’ emphasizes a feature 


about my whole conceptual system con- 
cerning drives and values which has not 
been made sufficiently explicit. It ap- 
pears, in short, that when a type of 
entity has positive value because it is 
either a type of goal or a type of sub- 
goal, this means not only that there is 
a propensity to get to any instance of 
it and to experience an immediate dis- 
charge in the fact of getting to such an 
instance, but also that there remains a 
propensity to use such an instance to 
go on to instances of the next subgoal 
or goal in the instrumental hierachy. 
Even an instance of a final goal pro- 
vides not merely a discharge of tension 
upon being reached, but it is then also 
manipulated by the consummatory ac- 
tivity as the way to arrive at the still 
more ultimate goal of drive subsidence. 
In the case of an equivalence belief, on 
the other hand, getting to and manipu- 
lating the type of entity which is ac- 
cepted as “equivalent” relieves the 
need-push for this latter, even though 
it does not get the organism on towards 


the final goal and does not help in re- 
lieving his ultimate underlying drive 
tension. Yet the organism may persist 
in getting to the “equivalent” because 
of his immediate relief, and consistently 
fail to discover that his underlying 
drive as such is not relieved. The indi- 
vidual seems to have, in short, as a re- 
sult of an equivalence belief, what has 
sometimes been called an_ insatiable 
neurotic need. 

Finally, it is obvious that, if depth 
interviews and all kinds of fancy clini- 
cal techniques are necessary to unravel 
an individual’s specific instrumental be- 
liefs, such interviews and such fancy 
techniques will be all the more neces- 
sary to unravel the individual’s equiva- 
lence beliefs. 

(4) Finally, let me turn now, very 
briefly, to the operational definitions 
of my remaining concepts: the appre- 
hended behavior field with its qualities, 
directions and distances, the appre- 
hended behaving self, and apprehended 
valences and need-pushes. 

My concept of an apprehended be- 
havior field is patterned very closely 
after Lewin’s “outer life space” (14). 
The apprehended behavior field is the 
form in which any immediate environ- 
ment is perceived and/or remembered 
and/or inferred by the organism at a 
given time. Such a construct of a psy- 
chological life space or apprehended be- 
havior field as distinct from the true 
“geographical environment” (to use 
Koffka’s term [11]) undoubtedly arises 
in the first place as a result of the 
alarums and excursions of each indi- 
vidual’s own everyday experience. The 
environment appears to each of us to 
be so and so, but we examine further 
and discover the facts of illusion, and 
of our own perceptual, memorial and 
inferential insufficiencies. We discover 
the difference between appearance and 
reality—between our immediately ap- 
prehended behavior spaces and the real 
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environments, which latter we discover 
from communal investigations by our- 
selves and our fellows, using meter 
sticks and all the paraphernalia of both 
the macroscopic and microscopic physi- 
cal and social sciences. When we turn 
to the study of the lower animals, it 
becomes more than obvious that the ap- 
prehended behavior fields of these ani- 
mals must be in many ways quite dif- 
ferent from ours—given one and the 
same geographical environments. In 
short, we are reinforced in the distinc- 
tion between the true geographical en- 
vironment of any individual X (man or 
animal) as described by a community 
of independent observers provided with 
the measuring tools of the sciences and 
the apprehended situation (or behavior 
field) of Individual X as it is indicated 
by his behavior in some specific seg- 
ment of time. 

But still the question remains as to 
how we can arrive at operational defi- 
nitions of such behavior fields and of 
their contents. 

Here it must be admitted we are at 
present in a pretty unsatisfactory stage. 
We again find ourselves today relying 
in large part on empathy, intuition, 
hunches, introspective phenomenology, 
and various other subjective, non-opera- 
tionally definable procedures. The be- 
ginnings of a more objective, more op- 
erational way for arriving at the prob- 
slowly being developed, especially by 
able behavior fields for given organisms 
at given times are, however, I believe, 
animal psychologists. 

The probable perceptual qualities in 
a rat’s field are being arrived at by 
discrimination experiments which relate 
stimuli and patterns of stimuli (physi- 
cally defined) to discriminable differ- 
ences in behavior. We learn not only 
about the possible discriminanda in the 
rat, but also such facts as that a hori- 
zontal row of dots looks like a solid 
horizontal line (Krech, 12). When all 
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the necessary work has been done, we 
will eventually be able to postulate what 
qualities, what discriminanda, arrayed 
in what Gestalten a rat will probably 
apprehend when presented with a given 
array of present and past stimuli. 

As to distances and directions, our 
maze work is just now getting to this 
point. It is to be hoped that in the 
near future with the use of sound-proof 
and light-proof domes (in which we can 
spot visual and auditory cues at will) 
we shall be able to discover the basic 
principles of a rat’s perception of spatial 
distances and directions. We shall be 
able to discover, that is, what arrays of 
spatially located cues are going to lead 
to such and such abilities or non-abili- 
ties to take a short-cut, to take an 
Umweg, to take the shorter of two 
paths, and the like. 

As to the rat’s or the chimpanzee’s 
perception of its behaving self, I confess 
I am stumped. However, I would guess 
that the uncovering of the principles of 
the perception of spatial distances and 
directions would probably also uncover 
something to the effect that these per- 
ceptions radiate out from some sort of 
an identified and moving locus, i.e., a 
behaving self. 

Next, as to need-pushes and valences, 
The concept of need-pushes as perceived 
in the behaving self has been forced 
upon me by a number of considerations 
(including, I have to admit, my own 
introspection). A need-push is, so to 
speak, the activating push in a concrete 
behavior field toward a valenced goal 
and it may or may not correspond in 
amount to what, by other means, we 
would define as the underlying value 
resulting from the given drive or need. 
Thus, for example, there are some very 
surprising results beginning to appear 
with rats. It looks from some of the 
recent latent learning findings—includ- 
ing those of Spence and Lippitt (25), 
Kendler and Mencher (10) and of 
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Walker (28), and some things we are 
now doing at California—that even 
though a rat’s drive be changed from 
thirst to hunger, his need-push may 
not, for a while, change correspond- 
ingly. The water place may still be 
valenced, even though the drive as 
measured by our original definition is 
now one of hunger. In other words, 
under some conditions, rats, and men 
too, do not seem to have need-pushes 
which correspond to their actual drives. 
(And also I would remark, parentheti- 
cally, they often experience valences 
which do not correspond to their actual 
values.) Why this happens, how much 
it may be a function of early childhood 
maintenance schedules involving dep- 
rivations or non-deprivations, or how 
much it may result from disrupting 
conflict situations such, say, as that in- 
volved in the use of doors in the origi- 
nal Spence and Lippitt (25) latent 
learning experiment and the forcing of 
the resistent animal down the path that 
he didn’t want, seem to me very excit- 
ing questions. They open up vistas for 
all sorts of further intriguing experi- 
ments. In any event, whatever we find 
with rats, I am going to assert that 
often one of the main functions of the 
clinician working with human beings is 
merely to aid the patient to discover his 
true drives and values so that his need- 
pushes and valences may come more 
nearly to correspond to the former. 
Perhaps this is the principle which 
Roger’s non-directive therapy has most 
strongly emphasized and capitalized on 


Finally, I would like to reiterate 
something I said at the beginning. In 
the matter of the final functioning in 
the behavior field of need-pushes, direc- 
tions and distances and valences, the 
neo-Gestaltists are undoubtedly right in 
emphasizing that the resultant behav- 
ior, or the resultant perceptual changes 
in the field, certainly will be in the di- 
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rection of what they. would call tension 
reduction and “better figure” in the 
behavior space. 

Wants and cognitions as interfused 
activities in the concrete apprehended 
behavior-fields do, I believe, function 
according to neo-Gestalt laws (which 
laws have, of course, for the most part 
not yet been either precisely discovered 
or stated). But I still believe that a 
second system of concepts—those of 
drives, or needs, of values and of inter- 
connecting beliefs which the organism 
carries about with him in his person- 
ality structure (i.e., in Lewin’s inner 
life space) and which combine with the 
laws of perception of external stimuli 
to produce the actual resultant behavior 
field with its inlying qualities, direc- 
tions, distances, need-pushes, and va- 
lences—is necessary. I also believe 
that these drives, values, cathexes, and 
beliefs may obey more associational 
and less Gestalt-like laws than people 
like Krech and Crutchfield (13) and 
McLeod (15) would like to admit. I 
am not too insistent on this latter point, 
although certainly my tendency, and 
that of almost everybody else, to re- 
sort to such analogies as “channels” 
suggests associationism rather than 
Gestaltism. But the point I am in- 
sistent upon is that this problem of the 
drives and values and beliefs in the 
given personality structure is equally as 
necessary and important as is that of 
the dynamics of the immediate func- 
tioning of need-push and valence in the 
immediately apprehended behavior field. 
I shall, in short, continue to search for 
basic drives and continue to seek to 
analyze relatively enduring belief and 
value structures as something different 
from, and obeying different laws from, 
immediate fields and their dynamics— 
whether these fields be conceived phe- 
nomenologically or behavioristically. I 
shall persist to the end in being a drive- 
belief-value psychologist as well as a 
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behavior-field psychologist and never 
will regress, or advance, to being a pure 
all-embracing field, or neo-Gestaltish, 
psychologist. 
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